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ABSTRACT 



This document contains papers from the Annual State Data 
Conference and includes the following: (1) "Does Money Matter for Minority 

and Disadvantaged Students? Assessing the New Empirical Evidence" (David 
Grissmer, Ann Flanagan, and Stephanie Williamson) , which demonstrates that 
additional money matters for minority and disadvantaged students, but may not 
matter for highly advantaged students; (2) "Rethinking the Allocation of 
Teaching Resources: Some Lessons from High Performing Schools" (Karen Hawley 
Miles and Linda Darling -Hammond) , which shows how five schools support 
extraordinarily high student achievement by reallocating instructional 
resources to maximize individual attention for students and learning time for 
teachers; (3) "Financing Education in the District of Columbia from the 
Perspective of the Financial Authority" (Joyce Ladner) , which describes the 
District of Columbia Financial Authority efforts to revamp the ar.ea ’ s failing 
schools; (4) "Does Money Matter? An Empirical Study Introducing Resource 
Costs and Student Needs to Educational Production Function Analysis" (Corrine 
Taylor) , which shows that geographic cost variations and special -needs 
student costs do not appreciably affect per student expenditures; (5) "School 
District Expenditures, School Resources and Student Achievement: Modeling the 
Production Function" (Harold Wenglinsky) , which uses hierarchical linear 
modeling to show that expenditures on instruction and central -of f ice 
administration affect teacher-student ratios, which, in turn, affect student 
achievement; (6) "The Development of School Finance Formulas To Guarantee the 
Provision of Adequate Education of Low- Income Students" (Andrew Reschovsky 
and Jennifer Imazeki) , which explores the quandary of developing an equitable 
school finance formula; two "cutting-edge" papers, (7) "Using Cost and Need 
Adjustments To Improve the Measurement of School Finance Equity" (Lauri 
Peternick, Becky A. Smerdon, William Fowler, Jr., and David H. Mou) ; and (8) 
"A Proposal for Collecting School-Level Resource Data on the Schools and 
Staffing Survey" (Julia B. Isaacs, Michael S. Garet, and Stephen P. 

Broughman) , examine effects of applying geographic-cost adjustments and 
student-need adjustments to traditional equity measures. (Most papers include 
several references.) (MLH) 



Reproductions supplied by EDRS are the best that can be made 
from the original document. 
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One of the persistent dilemmas in education finance has been the inconsistent results obtained in 
educational research regarding the effectiveness of higher spending on student outcomes. Many of the 
scholars in the 1997 National Center for Education Statistics (NCES) Summer Conference addressed this 
issue in their presentations, and their insights and the diversity in their approaches and thinking regard- 
ing this seminal issue make this proceedings among the most captivating in the series. 

Developments in School Finance contains papers by speakers at the annual NCES Summer Conference. 
The Conference attracts several state department of education policymakers, fiscal analysts, and fiscal 
data providers from each state, who are offered fiscal training sessions and updates on developments in 
the field of education finance. The presenters are experts in their respective fields, each of who has a 
unique perspective or interesting quantitative or qualitative research regarding emerging issues in educa- 
tion finance. The reaction of those who attended the Conference was overwhelmingly positive. We hope 
that will be your response as well. 

This proceedings is the fourth education finance publication from the NCES Summer Conference. 
The papers included within present the views of the authors, and are intended to promote the exchange of 
ideas among researchers and policymakers. No official support by the U.S. Department of Education or 
NCES is intended or should be inferred. Nevertheless, NCES would be pleased if the papers provoke 
discussions, replications, replies, and refutations in future Summer Conferences. 
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William J. Fowler Jr. 
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The education finance scholars that assembled 
for the 1997 National Center for Education Statis- 
tics (NCES) Summer Conference brought with them 
diverse views of the effects of money on elemen- 
tary secondary student outcomes. Some were san- 
guine because they had suggested new or alterna- 
tive statistical or research designs from traditional 
"production function" studies that yielded empiri- 
cal evidence of the positive effects of resources on 
student achievement. Others had examined in de- 
tail how a few schools with good student outcomes 
reallocated resources, in an effort to understand how 
changes in resources are linked to student effects. 
Still others demonstrated what they believed to be 
a consistent set of evidence that disadvantaged stu- 
dents who received more resources demonstrated 
higher gain scores. At least one scholar, however, 
found a large urban school district where, as a re- 
sult of alleged mismanagement, high per-pupil 
spending was not reaching the classroom and stu- 
dent outcomes seemed below those of comparable 
school districts. Another academic sought to de- 
sign a school finance system that would provide an 
"adequate" level of education, where adequacy is 
defined in terms of minimum standards of student 
performance. 



Not all of the presentations involved finance and 
student outcomes. A few researchers were involved 
in what might be considered the "cutting edge" of 
education finance research, rather than reexamin- 
^ ing the much-debated, albeit extremely relevant. 
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question of the nexus of financing and student out- 
comes. One handful of researchers aspired to revo- 
lutionize the traditional manner of measuring fis- 
cal equity. Their approach when measuring fiscal 
equity makes adjustments to equity measures for 
differences in geographic costs and the educational 
needs of students in a school district. Another set 
of enterprising scholars endeavored to devise a 
mechanism for collecting school-level financial data 
with the next administration of the NCES Schools 
and Staffing Survey (SASS), which is scheduled to 
be administered for the fourth time in 1999-2000. 
Should they succeed, the SASS finance survey would 
represent the first collection of traditional finance 
data from a nationally representative sample of pri- 
vate schools in 20 years, and the first ever for public 
schools (at the school level). 

The first three papers presented in this collec- 
tion of the proceedings reflect "real world" exami- 
nations of finances and student outcomes, rather 
than traditional education finance research designs 
that employ sophisticated statistical analyses of 
large-scale data bases that merge thousands of 
school district finances with thousands of students' 
achievements. While many in the statistical and re- 
search community would desire statistical support 
and replication for the "real world" findings, other 
researchers point out the value of qualitative and 
case studies in gaining insights that the "black box" 
statistical analyses cannot address. 
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David Grissmer, Ann Flanagan, and Stephanie 
Williamson, from RAND, pose the intriguing ques- 
tion of whether money matters for minority and dis- 
advantaged students. They argue that evidence is 
accumulating that may replace the "Money Doesn't 
Matter" hypothesis. This new hypothesis asserts 
that additional money matters for students from less 
advantaged backgrounds and minority students, 
but may not matter for students from more highly 
advantaged backgrounds. They first explain the 
evidence they see in the NCES National Assessment 
of Educational Progress (NAEP) concerning resource 
growth and targeting. They then discuss large score 
gains of black students and class size changes. Fi- 
nally, they look to new, experimental studies, rather 
than the quasi-experimental research that has been 
the cornerstone of the traditional education finance 
findings. 

Grissmer, Flanagan, and Williamson argue that 
the widely-accepted evidence that real per-pupil 
resources doubled in education from the late 1960s 
to the early 1990s, while NAEP scores stagnated, is 
incorrect for a variety of reasons. First, disaggre- 
gating the national average scores suggests that 
scores for all racial/ ethnic groups rose in reading 
and mathematics for all age groups. Second, using 
a different deflator from the Consumer Price Index 
(CPI) suggests the real increase (that is, after infla- 
tion) in educational expenditures was much lower 
than a doubling of resources. Third, most of this 
increase went for disabled students, who are not 
included in NAEP. Fourth, of the resources not di- 
rected toward disabled students, a disproportion- 
ate amount of resources was directed at minority 
students and students in poverty. 

Since Grissmer, Flanagan, and Williamson as- 
sert that family changes can only explain one-third 
of the NAEP gains of black students, they examine 
what happened in the educational system. During 
these years, preschools and kindergartens flour- 
ished, desegregation occurred in the South, class 
sizes decreased, and teachers' age, experience, and 
education increased. Grissmer, Flanagan, and 
Williamson reject changes to preschools and kinder- 
gartens as having the kind of sustained gains in the 
test scores of black students they observe. Desegre- 
gation explains some of the gains, but not all. In- 
stead, they believe that the school changes are the 



better candidates for explaining substantial parts of 
the NAEP gains, and of those, class size decreases 
seems the strongest. 

Grissmer, Flanagan, and Williamson contend 
that class size effects have the virtue of experimen- 
tal evidence supporting their relationship just as 
quasi-experimental research techniques are being 
called into serious question. The large, multi-dis- 
trict study in Tennessee where students were ran- 
domly assigned to smaller classes found significant 
positive effects on achievement, and larger effects 
for black students. Unfortunately, in the Tennessee 
experiment, students were returned to large classes 
after third grade, so we do not know what would 
have happened if students had remained in small 
classes until the end of school. In addition, class 
sizes fell in the 1960s, as well as the 1970s. If smaller 
classes had conferred long-term benefits, 17-year- 
olds who entered school in 1968 should have 
outscored those who entered in 1960, but this did 
not occur. 

Karen Hawley Miles, an independent educa- 
tion consultant, and Linda Darling-Hammond, 
from Teachers College, Columbia University, find 
little attention has been given to rethinking the use 
of existing instructional resources, especially teach- 
ers who are schools' most important and expensive 
resource. Miles and Darling-Hammond examine 
five schools that demonstrate that it is possible to 
support student achievement at extraordinarily high 
levels by reallocating instructional resources to 
maximize individual attention for students and 
learning time for teachers. They assert that it is un- 
likely that schools can find ways to create more in- 
dividual time for students or more shared planning 
time for teachers without prohibitively raising costs, 
unless they rethink the existing reorganization of 
resources. 

Miles and Darling-Hammond suggest that they 
focus primarily on the assignment and use of teach- 
ing staff because it is the most sizable and the most 
underexplored area for potential resource realloca- 
tion. They cite studies that demonstrate that few of 
new teaching staff were deployed to reduce class 
sizes for regular education students; most went to 
provide small classes to the growing number of spe- 
cial students, or for teacher release time. Only 43 
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percent of school staff are regularly engaged in class- 
room teaching, in comparison to 60 percent or more 
in European countries, enabling more time for col- 
laborative planning and professional development. 

Miles and Darling-Hammond describe six prac- 
tices widely found in schools, and portray the im- 
pact of each on the use of teaching resources: 

• Specialized programs conducted as add-ons; 

• Isolated instruction-free time for teachers; 

• Formula-driven student assignments; 

• Fragmented high school schedules and cur- 
riculum; 

• Large high schools; and 

• Inflexible teacher work day and job definition. 

The schools attempting to change these condi- 
tions used several strategies of resource reallocation. 
Reallocation involved reduction of specialized pro- 
grams, more flexible student groupings, longer and 
varied blocks of instructional time to create more 
personalized environments, expanded common 
planning time for staff, and creative work sched- 
ules and staffing roles. 

Only two of the five schools Miles and Darling- 
Hammond studied actually reallocated and restruc- 
tured existing programs and staff, while the others 
were brand-new schools which did not suffer from 
the six standard practices found in schools. Three 
were elementary schools, and two were secondary 
schools. Traditional elementary schools served regu- 
lar education students in age-graded, self-contained 
classrooms. Three-quarters of the school's teaching 
staff worked with regular education students, the 
remainder with Title 1 and special education stu- 
dents who were pulled out of their regular classes 
for such special instruction. Class composition and 
class size stayed the same all day, for all subjects, 
except for special instruction. The elementary class- 
room teacher instructed all subjects except special- 
ties like art, music, and gym, which were taught by 
specialists during the classroom teacher's free pe- 
riod. Teachers had 45 minutes 3 to 5 times a week 
free from instruction for planning, uncoordinated 
with other teachers' free time. 

11 



The high-performing schools changed this or- 
ganization, increasing the percentage of teachers 
who worked with heterogeneous groups of students 
to 90 percent. The anomalous elementary schools 
teachers' adapted instructional grouping to student 
needs. These atypical schools kept teachers with 
the same students for 3 years, usually with the same 
homeroom class. Some teachers received as few as 
nine new students a year. All the elementary eccen- 
tric schools created more common planning time, 
although only one made dramatic changes. These 
novel elementary schools created master-teacher 
and other instructional adults in the classroom. 
Similar changes occurred in the two high schools 
studied. 



To accomplish these changes, the uncommon 
schools directly challenged policies, regulations, and 
collective bargaining agreements. Changing school 
organizations to better fit an instructional vision 
does require schools to confront a host of obstacles. 
However, the biggest constraint may be a lack of 
vision. The sample schools described by Miles and 
Darling-Hammond are intended to assist those who 
lack a vision of what may be done. 

Joyce Ladner, a member of the District of Co- 
lumbia Control Board, describes the condition of the 
District of Columbia public schools. The Control 
Board concluded that the D.C. public schools were 
in crisis, by every important educational and man- 
agement measure. As seen by the Control Board, 
the District of Columbia Public Schools (DCPS) were 
simply failing in their mission to educate the chil- 
dren of the District of Columbia, by neither provid- 
ing a quality education nor a safe environment in 
which to learn. 

The Financial Authority was created by the U.S. 
Congress in 1995 to repair the District of Columbia's 
failing financial condition and to improve the man- 
agement effectiveness of government agencies. In 
November 1997, the Authority removed the super- 
intendent and stripped the Board of Education of 
most of its power to control the schools. In their 
place, the Authority appointed a new Chief Execu- 
tive Officer and an Emergency Board of Trustees. 

Ladner describes DCPS from a report entitled 
Children in Crisis: A report on the failure of the D.C.'s 

5 
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Public Schools. The DCPS' students score signifi- 
cantly lower on standardized academic achievement 
tests than their peers in comparable districts around 
the nation. In 1994, only 22 percent of DCPS' fourth- 
grade students scored at or above the basic NAEP 
level — a decrease of 6 percent from 1992. Many stu- 
dents are dropping out or leaving DCPS (40 percent) 
for neighboring districts and private schools. DCPS 
teachers in NCES' School and Staffing Survey (SASS) 
believed that a variety of serious problems affected 
their schools, more than in other states. These prob- 
lems included student unpreparedness; disrespect 
for teachers; absenteeism; and apathy. Compared 
with the national average, more DCPS teachers and 
students report being threatened with violence. The 
infrastructure of the District's public schools is col- 
lapsing: boilers bust, roofs leak, firedoors stick, bath- 
rooms crumble, and poor security permits intrud- 
ers. The system does not know how many students 
it has, with estimates varying from 65,000 to 81,000. 

Comparisons of the District's school expendi- 
tures with other jurisdictions are difficult because 
of the above managerial irregularities, which extend 
into the fiscal area. However, the District's per-pu- 
pil expenditure exceeds the national average, and 
is substantially higher than many comparable ur- 
ban school districts and neighboring districts (one 
exception is Newark, New Jersey, which spent $2,512 
more per student than the $7,655 DCPS spent in 
1994-95). DCPS employs 16 teachers for every cen- 
tral administrator employed, compared with its 
peers who employ 42 teachers for every central ad- 
ministrator. In 1996, DCPS allocated more toward 
its Office of the Superintendent than the Fairfax 
County, Montgomery County, and Baltimore City 
public school systems combined. It also spent more 
than twice as much on the Office of the Board of 
Education than peer and neighboring district aver- 
ages. 

Ladner concludes with some of the steps of the 
new management team, such as imposing a hiring 
freeze, closing 11 schools and replacing 50 roofs, in- 
creasing security with security guards and new 
metal detectors, establishing a teacher evaluation 
program, ending social promotions, terminating a 
large contracted school maintenance contractor, and 
initiating new contracted services for school break- 
fasts and lunches. 

ERK - 



While the above studies described "real world" 
evidence of how school districts deploy staff re- 
sources, three educational researchers conducted 
more traditional production-function studies from 
large databases. These studies use econometric tech- 
niques to find relationships between educational 
outcomes to school resources, while statistically con- 
trolling for student background characteristics. In 
the first of these, Corrine Taylor, University of Wis- 
consin, argues that none of the previous studies ad- 
equately accounted for geographic cost variations 
nor the costs created by the proportion of students 
with special needs. She hypothesizes that a stron- 
ger relationship between student achievement and 
school expenditures will emerge once these costs are 
taken into account. 

Taylor employs NCES data sources: the National 
Education Longitudinal Study of 1988 (NELS); the 
Common Core of Data (CCD); and a district-level 
teacher cost index (TCI). NELS contains a nation- 
ally-representative sample of students who were fol- 
lowed at the 8th, 10th, and 12th grade level in 1988, 
1990, and 1992, who took cognitive tests, and who 
completed questionnaires about a wide variety of 
education issues and student's interest and effort in 
school. Their parents, teachers, and school admin- 
istrators also completed questionnaires regarding 
SES and school conditions. The TCI is a geographic 
cost adjustment that is an index that reflects the cost 
of employing teachers in particular regions of the 
country, based upon job and location amenities. The 
SASS provided the data to apply a regression analy- 
sis of the factors that influence teacher salaries, in- 
cluding those that are under the control of school 
districts, such as teacher experience and degree sta- 
tus, and those that are not, such as cost of living 
and quality of life. The estimates of these charac- 
teristics on teacher salaries results in an index num- 
ber, where 100 is the national norm, that can be used 
to estimate teacher costs, holding constant discre- 
tionary factors. TCI index numbers are available at 
the state, county, and school district levels (Cham- 
bers and Fowler, 1995). 

The paper does not mention the extensive work 
that is required to combine the data sets, NELS, 
CCD, and TCI. That integrating the data sets is prob- 
lematic may be deduced by Taylor's description of 
her sample of students, she includes only public 
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school students who participated in all three waves 
of the National Education Longitudinal Study of 
1988 (NELS) study (11,598 students), who never 
dropped out of school (11,503), and who attended 
the same high school in both 1990 and 1992 (11,167). 
These restrictions were necessary if one wishes to 
consider only those students who are consistently 
associated with school resources at particular 
schools. 

Taylor is very comprehensive. She examines 
three expenditures per pupil: total current; core; in- 
structional salaries. She then cost adjusts each for 
both geographic differences and student need dif- 
ferences. The geographic cost adjustment is accom- 
plished by dividing each nominal per-pupil expen- 
diture by the TCI (times 100). Student educational 
need is measured by including separate control vari- 
ables for the proportion of students in special edu- 
cation, limited English proficiency, and compensa- 
tory education, based on information from the Com- 
mon Core of Data (CCD) school district data. 

Using the 1992 mathematics score as the depen- 
dent variable, she includes prior achievement, stu- 
dent and family characteristics, student interest and 
effort, student view of the school environment, 
peer's characteristics, special needs students, com- 
munity characteristics, and school characteristics. 
Although she finds the consistently positive and sta- 
tistically significant effects of per-pupil expenditures 
on high school students' academic achievement, the 
effects do not increase appreciably when per-pupil 
costs are adjusted for geographic or student need. 
She concludes that these results demonstrate that 
the lack of a strong relationship between student 
achievement and school expenditures cannot sim- 
ply be attributed to mismeasurement of the school's 
fiscal resources. 

In yet another production function study, 
Harold Wenglinsky, of ETS, uses a different NCES 
national sample of student achievement, the Na- 
tional Assessment of Educational Progress (NAEP). 
He applies structural equations and hierarchical lin- 
ear modeling (HLM) in an attempt to address what 
he believes to be the shortcomings of standard pro- 
duction function studies. Reviewing some 30 years 
of production-function studies in education (ap- 
proximately 400 studies), he restates the imperfec- 




tions in most of the studies: the data were not na- 
tionally representative, but instead use data from a 
particular state or school district; the usual measure 
was per-pupil expenditure, rather than more dis- 
crete measures, such as administrative overhead, or 
per-pupil instructional expenditure; the process of 
schooling was not considered, which may mediate 
the relationship between expenditures and student 
outcomes; measures of student background were in- 
adequate; differences in costs caused by geography 
were not considered; especially in the early studies, 
measures of outcomes were unsophisticated, such 
as graduation rates. 

Wenglinsky attempts to remedy these problems 
in his study via differences in the nature of the data 
base he employs and the nature of the analyses un- 
dertaken. He uses a NCES data base, NAEP, that 
contains a nationally-representative sample of stu- 
dent and school information from 4th-, 8th-, and 
12th-graders, and information from their teachers 
and principals. The subject areas tested vary, but 
have included at various times mathematics, read- 
ing, history, geography, and science. Wenglinsky 
uses the 1992 mathematics assessment of students 
attending fourth grade, which contains measures of 
mathematics achievement, school environment, 
teacher education levels, teacher-student ratios, and 
student and school-level SES, He combines this data 
set with the CCD fiscal data for school districts. Note 
that this yields the same expenditure for every child 
in a school district, regardless of the school that they 
attend. Wenglinsky then adjusts these expenditures 
using the state-level cost adjustment, the TCI, rather 
than the school-district-level TCI adjustment that 
Taylor uses. For states which experience large 
within-state differences, such as New York, or Illi- 
nois, use of the state geographic cost adjustment will 
be less precise than the use of a county or school- 
district measure. 

Wenglinsky links these data bases to yield a dis- 
trict level and student level. The district level was 
produced by aggregating NAEP data to the district 
level and linking it to the district-level CCD. Since 
NAEP is a sample of students, only the 203 school 
districts that could be matched were included in the 
analysis. The district-level database was used for 
all analyses except a multi-level approach. Let us 
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explore for a moment why some multi-level ap- 
proach may be desirable. 

A common dilemma of education researchers is 
that the data often come from hierarchical data struc- 
tures. For example, expenditure data are from the 
school district level, the teachers are employed and 
conduct their classes at the school level, and stu- 
dents' achievements are at the student level. Since 
traditional statistical techniques for modeling hier- 
archy have been inadequate, the usual choice has 
been to ignore these differences, and to combine all 
characteristics at one level of aggregation. Assume 
this is done in work similar to the Wenglinsky pa- 
per. The resulting data set may be at the student 
level, with a similar expenditure for every child in 
the same school district, and a similar school envi- 
ronment measure for every child in the same school 
(or aggregated in the opposite direction, so that there 
is only a average student achievement score at the 
school district level). In reality, we know that every 
child in a school district receives different alloca- 
tions of resources, so the per-pupil expenditure 
should vary for every student, and the environmen- 
tal measures should vary for every classroom (if not 
for every student). Ignoring these limitations have 
resulted in a variety of statistical problems, which 
make such studies vulnerable to legitimate criti- 
cisms by other education researchers. 

Recent developments (such as Hierarchical Lin- 
ear Modeling (HLM)), however, have led to the de- 
velopment of several approaches to analyzing hier- 
archical data sets, in which the researcher may re- 
tain data at the appropriate level, and then run 
analyses that compare these attributes properly 
(Bryk and Raudenbush, 1992). As will be discussed 
later, NCES is exploring the possibility of collecting 
a student-level resource measure with other student- 
level data in its sample surveys. 

Returning to a discussion of the Wenglinsky 
paper, he uses two statistical analyses: LISREL, in 
which the educational researcher specifies how he 
believes each variable effects others; as well as HLM. 
Wenglinsky hypothesizes that a student's academic 
achievement is modified by the school environment 
and the teacher's highest degree and the student- 
teacher ratio, as intervening variables between the 
school district's resource choices and student 



achievement. He also examines more discrete ex- 
penditures than the total per-pupil expenditure, uti- 
lizing the school district instructional per-pupil ex- 
penditure, the central administration per-pupil ex- 
penditure, and the school administration per-pupil 
expenditure. The HLM analysis consisted of stu- 
dent achievement as the dependent variable and two 
resources (teachers' highest degree and teacher-stu- 
dent ratios) as independent variables. Wenglinsky 
finds that expenditures on instruction and central 
office administration affect teacher-student ratios, 
which, in turn, affect student achievement. The re- 
lationships also persisted when subjected to multi- 
level analysis using HLM. Interestingly, unlike Tay- 
lor, Wenglinsky finds that the relationships were af- 
fected by modifying the expenditures for geographic 
cost differences. 

Andrew Reschovsky and Jennifer Imazeki of 
the University of Wisconsin-Madison explore the 
quandary of developing a school finance formula 
that guarantees the provision of an adequate edu- 
cation to low-income students. Imazeki and 
Reschovsky recognize that the cost of education can 
be defined as the minimum amount of money that 
a school district must spend in order to achieve a 
given education outcome. In comparing two dis- 
tricts with equal spending per pupil, educational 
performance may be lower in one of the districts if 
the costs of providing any given level of education 
are higher in that district, or if that district is more 
inefficient in its use of resources. 

They stress that the importance of costs in any 
discussion of equity in the financing of schools is 
that the achievement of equity (in outcomes) will 
require higher spending in districts facing high costs. 
The courts are moving from a focus on equity in 
spending to one of educational adequacy, where ad- 
equacy is defined in terms of minimum standards 
of student performance. Imazeki and Reschovsky 
believe a prerequisite for designing a outcome-eq- 
uitable school finance system is knowledge about 
how much it will cost each school district to pro- 
vide an adequate education for its students. In their 
paper, they review traditional school aid distribu- 
tion formulas, as well as other cost measures, and 
then go on to develop their own cost index for school 
districts in Wisconsin, which takes into account stu- 
dent educational needs. They then develop a simu- 
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lation of a school aid formula designed to achieve 
education adequacy. 

The traditional way that states finance the edu- 
cation of students with special educational needs is 
by "weighting" them; that is, if a school district re- 
ceives $1,000 for a regular student, a handicapped 
student might generate $2,300 in state aid for the 
school district, or 2.3 times as much. These weights 
typically have been derived from episodic studies 
of a few school districts or states where information 
exists regarding what some school districts spend 
for the education of such children. Other geo- 
graphic cost indexes, such as McMahon or Cham- 
bers, do not consider student educational need in 
an explicit way. As such, they understate the costs 
of some school districts, since some school districts 
will have to hire more teachers (perhaps at a pre- 
mium) and spend more on non-teacher resources 
(social workers, drug counselors) in order to achieve 
any specific educational goal. Indeed, even more 
sophisticated efforts (such as Duncombe, Ruggiero, 
and Yinger, 1996) that include student need typi- 
cally measure the cost of purchasing a given set of 
inputs to be used in providing the education of stu- 
dents. 

Imazeki and Reschovsky specify a regression 
equation where student outcomes are a function of 
school resources, the characteristics of students, and 
the family and neighborhood. They consider such 
student need variables as the percent of students 
with disabilities (and severe disabilities), and the 
percent of students eligible for free and reduced- 
price lunch. They also use a "value-added" mea- 
sure of student achievement; that is, the change in 
test scores over time. Because of the complexities 
involved, Imazeki and Reschovsky decided not to 
include a measure of efficiency. As has been previ- 
ously found, they find that there is a "U-shaped" 
relationship between spending per pupil and school 
district size, and, as expected, higher proportions 
of students from poor families and those with dis- 
abilities are associated with higher costs. 

Setting the tenth-grade score at the average for 
all Wisconsin districts as the adequacy standard, 
Imazeki and Reschovsky construct a cost index by 
using the results of the regression to predict hypo- 
thetical spending for each district. These predic- 



tions are then compared to actual spending in a av- 
erage district with average costs and average levels 
of educational outcomes. They then go on to de- 
velop a state-aid formula to fund the "adequate" 
level. Surprisingly, while per pupil aid remains sub- 
stantially higher in low-property wealth districts as 
compared to high-property wealth districts, the larg- 
est percentage increases in aid go to high-wealth 
school districts. 

Let us now turn to two presentations that did 
not involve finance and its relationship to student 
outcomes. These researchers were involved in what 
might be considered the "cutting edge" of educa- 
tion finance research by examining the effect on tra- 
ditional fiscal equity measures of applying geo- 
graphic cost adjustments and student need adjust- 
ments. Lauri Peternick and Becky Smerdon, 
American Institutes for Research, William Fowler, 
NCES, and David H. Monk, Cornell University, 
were struck by the dramatic differences in the coef- 
ficient of variation (CV) when geographic cost ad- 
justments were applied to nominal per-pupil expen- 
ditures. Using New York State school districts' ex- 
penditures per pupil from the CCD, they examined 
financial equity within the state by conducting two 
sets of analyses, including and excluding New York 
City. One set of per-pupil expenditures were nomi- 
nal, another were adjusted for student needs, a third 
used the geographic cost adjustment of the TCI, and 
a fourth used both student needs and geographic 
cost. Student needs used 2.3 weights for students 
with an "individual educational plan (IEP)," and 1.2 
for students at-risk (in poverty) and limited English 
proficient (LEP). Four equity measures were ex- 
amined: the CV; the Gini coefficient; the McLoone 
Coefficient; and the slope. 

Peternick, Smerdon, Fowler, and Monk find the 
CV is greatest when measuring nominal per-pupil 
expenditures. Employing a geographic cost adjust- 
ment reduces the CV, as does the needs adjustment. 
Applying both adjustments almost halves the CV. 
The Gini is similarly affected. The McLoone Index, 
which measures equity for the lowest half of the dis- 
tribution, however, demonstrates the largest ineq- 
uity with a geographic cost adjustment. Including 
New York City in the analysis, the nominal data 
show increased equity. The opposite occurs when 
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the per-pupil expenditures are adjusted for geo- 
graphic cost differences or needs. 

The slope demonstrates the relationship be- 
tween median household income and per-pupil ex- 
penditures. Cost adjustments hinder the explana- 
tory power of median household income (or hous- 
ing value), while student needs adjustments serve 
to increase income's explanatory value. Median 
household income has a larger effect when New York 
City is included. 

Petemick, Smerdon, Fowler, and Monk conclude 
that the results presented demonstrate the varying 
impact different adjustments may have on equity 
measures. In addition, the inclusion of a single large 
urban school district may have dramatic effects 
upon the results of equity measures. 

The final article is from researchers seeking to 
address the demand for school-level resource data. 
Aside from the interest of parents and taxpayers 
about resource allocation and productivity of their 
school, there are questions of accountability and 
management, as well as equity and adequacy that 
are feeding the thirst for financial information at the 
school level. Julia Isaacs and Michael Garet of 
American Institutes for Research, and Stephen 
Broughman, NCES, are attempting to design a 
method to collect school-level financial data during 
the next administration of the NCES Schools and 
Staffing Survey (SASS), which is scheduled to be 
administered for the fourth time in 1999-2000. SASS 
provides nationally-representative and state-repre- 
sentative data about schools, and any financial data 
would permit baseline estimates of spending in the 
nation's schools. Presently, only a few states have 
accounting systems that extend to the school level, 
and the existence of more than 84,000 public schools 
in the country make it unlikely that any uniform 
reporting system would be quickly adopted. Most 
financial reporting is still contained at the school- 
district level. 

Only two extant systems are in use when school- 
level fiscal resources are reported. The most com- 
monly used is a simple extension of the existing ac- 
counting system to the school level. Coopers and 
Lybrand developed a software package of this type 



that the lay person could use to recode the school 
district budget to the school level. ,■ It contains algo- 
rithms to allocate expenditures from the school dis- 
trict level to the school for some functions (such as 
student transportation), using some information 
pertinent to the activity (such as numbers of stu- 
dent transported). Every state that has implemented 
school-level financial reporting has used the tradi- 
tional accounting system extended to the school- 
level. 

However, Isaacs, Garet, and Broughman de- 
scribe an alternative approach from their AIR col- 
leagues, called the Resource Cost Model (RCM). 
The RCM is a "bottom-up" approach to school re- 
sources, aggregating from the school-level the num- 
ber of staff in certain assignments, and the time they 
spend in certain activities. Prices are then assigned 
to each person for each assignment. In this way, the 
"service delivery" system can be described, as can 
its cost. For example, two schools may give com- 
pensatory students additional instruction: one in a 
"pull-out" service delivery system; the other by hav- 
ing an aide assist the student. As one can imagine, 
the "pull-out" delivery system, where a student is 
sent to another class with another teacher, will be 
much more expensive than the simple assistance of 
an aide. 

Isaacs, Garet, and Broughman have developed 
a proposal for collecting school-level financial data 
via a questionnaire to the school business official 
(who usually resides at the school district level). The 
business official would report school expenditures 
(if he has them), and expenditures at unspecified 
locations. These unspecified expenditures would 
then be prorated, using additional information 
needed for prorations. 

A group of education finance experts convened 
by Isaacs, Garet, and Broughman in January, 1998, 
suggested that a synthesis of the two approaches be 
attempted. Work on refining the public school ex- 
penditure instrument is still underway. 

One prospective note about a comment made 
earlier. Much of this volume revolved around the 
connection between per-pupil expenditures and stu- 
dent achievement, and the difficulties researchers 
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encountered because financial data were at a higher 
level of aggregation than the rich student-level in- 
formation that NCES obtains through its student- 
level surveys. The Education Finance Statistical 
Center (EFSC) within NCES is conducting work to 
see if a student-level resource measure can be de- 
veloped in time to accompany a longitudinal study 
of students that will begin in 1999 for kindergart- 
ners, termed the Early Childhood Longitudinal Sur- 



vey (ECLS). For the most up-to-date information 
on the work of the EFSC, NCES finance publications, 
finance graphics, and finance data sets, including 
those containing geographic cost adjustments, read- 
ers are urged to visit the web site http:// 
nces.ed.gov/edfin where readers may also email fi- 
nance questions, if they are not already answered 
in "frequently asked questions." 
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Minority and Disadvantaged Students ? 
Assessing the New Empirical Evidence 



David Grissmer 
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Until relatively recently the consensus among 
social scientists was that providing schools addi- 
tional resources would have little impact on student 
achievement — the so-called "money doesn't matter" 
thesis (Ladd, 1996). This counter-intuitive view ac- 
tually dates from the "Coleman Report" which 
found family influence strong and little effects of 
school resources (Coleman et al., 1966). Influential 
reviews by Eric Hanushek (1989, 1994, 1996) also 
argued that evidence from over 300 empirical stud- 
ies provided no consistent evidence that increased 
school resources raised achievement scores. While 
this view was consistently challenged by many edu- 
cators, policymakers, and parts of the research com- 
munity, the empirical evidence simply suggested 
otherwise. 

This scholarly consensus began to crack in the 
early 1990s. Hedges and his colleagues conducted a 
formal meta-analysis of the studies that Hanushek 
had reviewed. They found that most of these stud- 
ies lacked the statistical power to detect resource 
effects even when they were quite large. When 
Hedges and his colleagues pooled data from all 
available studies, the results indicated a positive, 
statistically significant effect and provided evidence 
that some programs may have large effects (Hedges 
et al., 1992; Hedges and Greenwald, 1996). Other 
work conducted with alternate methodologies like 



RAND 

Hierarchial Linear Modeling rather than the "pro- 
duction function" framework used in the economet- 
ric community often showed positive effects of re- 
sources. 1 

Nevertheless, Hanushek made one argument 
that was hard to rebut. Measured in constant dol- 
lars, per-pupil expenditures (PPEs) doubled be- 
tween the late 1960s and the early 1990s. Yet the 
National Assessment of Educational Progress Tests 
(NAEP) of representative samples of 9-, 13-, and 
17-year-old children seemed to show little improve- 
ment during the period when resources rose so rap- 
idly. The increases in reading and mathematics 
scores from the early 1970s to 1992 were between 
0.1 and 0.2 standard deviation or about 4- 5 percen- 
tile points. 

However, accumulating evidence is now chal- 
lenging both the NAEP evidence and the accuracy 
of previous empirical studies. The accumulative 
evidence is certainly sufficient to replace the 
"money doesn't matter" hypothesis with one that 
states that additional money matters for students 
from less advantaged backgrounds and minority 
students, but may not matter for students from more 
highly advantaged backgrounds. Several lines of 
research are converging toward this hypothesis. 
They include the following: 



1 For two recent examples see Gamoran, 1996 and Raudenbush, forthcoming. 

ERIC 19 



15 



Developments in School Finance, 1997 



• Re-analysis of experimental data on effect of 
class size 

• Evidence that model specifications used in 
many previous studies involving non-experi- 
mental data have been flawed 

• Evidence that from 1967-91, increases in 
available educational resources aimed at in- 
creasing regular students' achievement has 
been markedly overestimated 

• Evidence that the more limited real resources 
available to increase achievement scores from 
the late 1960s to the early 1990s was dispro- 
portionately targeted at minority and lower 
income children 

• Evidence that minority and less advantaged 
children made substantial gains in test scores 
in the 1970 to 1990 period, but more 
advantaged white students 
made only small gains 

• Evidence that the timing of 
score gains of minority chil- 
dren seem to be related to 
both the civil rights and war 
on poverty efforts as well as 
declines in class size. 

A more consistent set of evi- 
dence is now emerging which 
shows that disadvantaged students 
received the largest resource gains 
and that large score gains occurred 
among these students. We first dis- 
cuss the evidence from NAEP scores 
and the companion findings concerning resource 
growth and targeting. We then discuss several hy- 
pothesis for large score gains among blacks in the 
1970s and 1980s, and the correspondence with ex- 
perimental data on the effects on class size. Finally 
we discuss why estimates on the effects of resources 
from non-experimental data are now being seriously 



challenged, and probably have to be discounted in 
favor of the experimental data. 

Rising Resources and Rising NAEP 
Scores 

The often-quoted evidence that real per-pupil 
resources doubled in education from the late 1960s 
to early 1990s while NAEP scores stagnated is 
flawed on four accounts. First, although mean 
NAEP scores did not rise much, this was partly be- 
cause of rapid growth in the low-scoring Hispanic 
population. When disaggregated, scores for all ra- 
cial-ethnic groups rose in reading and mathematics 
for all age groups. Non-Hispanic whites scores rose 
by smaller amounts, while scores for Hispanic and 
blacks rose dramatically. Second, the real increase 
in educational expenditures was far less than the 
CPI adjusted PPE data would indicate. Use of more 
appropriate indices for adjustment 
of educational expenditures due to 
their labor intensity provides much 
smaller estimates of real growth. 
(Rothstein and Miles, 1995; Ladd, 
1996a) Third, a significant part of 
the smaller estimated increase went 
for students with learning disabili- 
ties, many of whom are not tested. 2 
A significant part also went for 
other socially desirable objectives 
that are only indirectly related to 
academic achievement. Taking into 
account better cost indices and in- 
cluding only spending which 
would have been directed at in- 
creasing achievement scores, 
Rothstein and Miles (1995) concluded that the real 
increase in per pupil spending on regular students 
was closer to 30 percent than to 100. 

Finally, the association of additional resources 
with increased test scores depends upon the distri- 
bution of the increased spending. The evidence in 
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All sides agree that a disproportionate fraction of the expenditure increase during the NAEP period was directed toward 
special education (Lankford and Wyckoff, 1996; Hanushek and Rivkin, 1997). Hanushek and Rivkin estimate that about a 
third of the increase between 1980 and 1990 was related to special education. NAEP typically excludes about 5 percent of 
students who have serious learning disabilities. However, special education counts increased from about 8 percent of all 
students in 1976-77 to about 12 percent in 1993-94. These figures imply that 7 percent of students taking the NAEP tests were 
receiving special education resources in 1994, compared to 3 percent in 1976-77. This percentage is too small to have much 
effect on NAEP trends, but it should in principle have had some positive effect. 
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Rothstein and Miles (1995) shows that a dispropor- 
tionate amount of resources was directed toward 
minority and lower income students. 3 Scores of 
minority students and lower scoring white students 
all showed large gains. The argument that additional 
resources did not matter is not applicable to these 
students. However, if significant additional re- 
sources were also directed toward advantaged stu- 
dents, the evidence would show much smaller gains, 
and the argument that "money doesn't matter" may 
apply to these students. 

NAEP Data 4 

Trends. Figure 1 shows how black and white 
17-year-olds' reading and mathematics scores 
changed between 1971 and 1996. 5 Figures 2 and 3 
show the same data for 9- and 13-year-olds. Each 
score by race is relative to the earliest test score re- 
corded, so a difference between black and white 
scores at a given year represents a change in the 
black-white score gap. The following points stand 
out: 

• The black-white gap narrowed for all ages in 
both subjects due to substantial gains in black 
students' scores while white students regis- 
tered smaller gains. 



• The black-white gap narrowed the most for 
13- and 17-year-olds due to dramatic increases 
in black scores from the late 1970s to the late 
1980s when black gains were 0.6 to 0.7 stan- 
dard deviation above white gains. 

For 13- and 17-year-olds, the gap stabilized or 
widened in the 1990s due to significant declines in 
black reading scores and stable black mathematics 
scores, while white mathematics scores were increas- 
ing. By 1996 black students' gains were between 
0.2 and 0.6 standard deviations greater than white 
students' gains. The black- white gap for 9-year-olds 
narrowed by 0.25 to 0.35 standard deviation by 1996. 
The pattern of gains among this group is quite dif- 
ferent from that of black adolescents, and the pat- 
tern also differs somewhat for reading and math- 
ematics. Black 9-year-olds gained more than older 
blacks during the 1970s and gained less than older 
blacks during the 1980s. Although reading scores 
among black 9-year-olds show declines after 1988, 
unlike adolescent reading scores, they have returned 
to 1988 levels. Additionally, mathematics scores con- 
tinued increasing after 1988 among this cohort. 

It is important to stress that even when black 
gains were largest, they never came close to elimi- 
nating the black-white gap. The largest reduction 
in the gap was for 17-year-olds' reading scores be- 
tween 1971 and 1988. In 1971 the median black 



3 Rothstein's and Miles' data analyzed detailed data in only nine school districts. More national evidence is needed concerning 
the relative allocation of additional resources among different types of students. There is little doubt that many of the new 
programs which were initiated or expanded were directed toward minority or low-income children. These included 
compensatory education programs such as Title 1 and HEADSTART, efforts within states to change to more equitable funding 
formulas and desegregation initiatives. However, funding also may have increased for advantaged students. More direct 
evidence is needed from school-district-level analysis of funding trends for high and low income districts. 

4 See Cambell et al., 1994 and Cambell et al., 1996 for descriptions of the NAEP data and further references. 

5 The scores have been converted to relative scores by assuming the earliest test score for each race is zero. Thus, the difference 
in scores reflects changes in the black-white gap from the earliest test. The scores are converted to standard deviation units by 
taking the mean score difference from the earliest test and dividing by a metric that remains constant over the period — the 
standard deviations of all students for the earliest year. Another common practice is to measure the gap with respect to the 
standard deviation in the same year. Since the standard deviation for all students declines for mathematics scores, but 
increases for reading scores this method changes the metric over time and would result in a somewhat different measure of 
gap reduction. 

The 1973 and 1971 scores for non-Hispanic white students were estimated because the only published scores are for combined 
Hispanic and non-Hispanic white students in those years. Tests after 1973 have separate data for Hispanic and non-Hispanic 
white students. We make a small correction in the 1971 and 1973 white data by determining the proportion by age group of 
students who were Hispanic and assuming that the difference between Hispanic and non-Hispanic white scores were the 
same in 1971 and 1973 as for the 1975 reading and 1978 mathematics tests. 

Also the 17-year-old NAEP scores reflect only students rather than all 17-year-olds. Consequently, the 17-year-old scores will 
be biased with respect to 9- and 13-year-old scores. We make a correction for 17-year-old scores using the proportion of 17- 
year-olds in school by race in 1971-73 and 1996. We assume that those not tested would have scored one-half standard 
deviation below the mean score for their respective race — probably a conservative assumption. School enrollment data from 
the October Current Population Survey (CPS) shows approximately 88 percent of white and 83 percent of black 17-year-olds 
were in school in 1970 versus 89 and 90 percent in 1988 (Cook and EvanOf^thcoming). 
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Figure 1. — NAEP mathematics and reading scores for 17-year-old students, by race/ 
ethnicity 



Standard deviation units 




SOURCE: U S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 



Figure 2. — NAEP mathematics and reading scores for 13-year-old students, by race/ 
ethnicity 



Standard deviation units 




Mathematics scores 
for blacks 



Reading scores 
for blacks 



Mathematics scores 
for non- Hispanic 
whites 



Reading scores 
for non- Hispanic 
whites 



SOURCE: U.S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 
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Figure 3. — NAEP mathematics and reading scores for 9-year-old students, by race/ethnicity 



Standard deviation units 




Mathematics scores 
for blacks 



Reading scores 
for blacks 



Mathematics scores 
for non-Hispanic 
whites 



Reading scores 
for non-Hispanic 
whites 



SOURCE: U.S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 



scored between the 10th and 12 percentiles of the 
white distribution. By 1988 the median black scored 
between the 26th and 28th percentiles of the white 
distribution. For the other age groups, the gap re- 
mained even wider. 

Cohorts. Reading and mathematics are ordi- 
narily thought of as a cumulative process in which 
early gains are necessary before later gains take 
place. Charting scores by entering school cohorts 
tests for the presence this pattern. We characterize 
cohorts by the year in which they would normally 
have been in first grade, namely the year in which 
they were six years old. Each entering school co- 
hort could have taken three tests in their school ca- 
reer — at age 9, 13, and 17. Figures 4 and 5 show 
each NAEP test score between 1971 and 1996 by the 
year of school entry. The scores for each age group 
are connected so that the pattern of increase by age 
within each entering school cohort can be more eas- 
ily determined. 



The following findings stand out: 

• Black gains were small for cohorts entering 
school prior to 1968. 

• The most significant black score gains oc- 
curred for cohorts entering school from 1968 
through 1972 and 1976 through 1980. 6 After 
this period of rapid increase in both math- 
ematics and reading scores, mathematics 
scores have stabilized while reading scores 
have declined. 

• Except for 9-year-old mathematics scores, co- 
horts entering school after 1980 have regis- 
tered no further score gains. 

• Black reading gains precede mathematics 
gains. The data show small gains in reading 
for cohorts entering in the 1960s and dramatic 
gains for the 1968-72 cohorts. The mathemat- 
ics data show no evidence of gains before the 
1971 cohort. 



6 .For cohorts entering before 1968, we have no data on 9-year-olds. The position of the cohort curve is thus more uncertain for 
9-year-olds than for the 13- or 17-year-olds — especially for reading. The large gain in 9-year-olds' reading between the cohorts 
entering in 1968 and 1972 may well indicate that there was also some gain at age nine between the 1964 and 1968 cohorts. If 
that were the case, and if we had the data, it would have the effect of raising all subsequent 9-year-old points in the cohort 
graphs and make the 9-year-old patterns closer to the pattern for older groups. For mathematics, however, 9-year-olds in the 
q cohorts that entered in 1970 and 1975 scored at about the same level, making earlier gains appear less likely. 
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Figure 4. — NAEP reading scores for black students, by year of school entry 
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SOURCE: U.S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 



Figure 5. — NAEP mathematics scores for black students, by year of school entry 
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SOURCE: U.S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 
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While the data does show strong cohort patterns, 
it also indicates that scores can increase at later ages 
above gains achieved at earlier ages. 

Regions. The regional data shows that signifi- 
cant black gains and black-white gap reductions 
occurred in all regions for each age and subject, al- 
though some regional differences do exist (See fig- 
ure 6 and 7)7 Black score gains were somewhat 
larger in the south and west, although the reduc- 
tion in the black-white gap was fairly uniform across 
regions. 

Taken together, the NAEP data raise a number 
of questions: 

• Why did both black and white scores rise for 
all ages in both reading and mathematics? 

• Why did black scores rise substantially more 
than white scores at all ages 

and in all subjects? 



• Why were black gains 
mainly concentrated for co- 
horts entering school be- 
tween 1968-72 and 1976-80? 

• Why did older black stu- 
dents gain and then lose 
more than younger black stu- 
dents? 

• Why did black reading gains 
precede black mathematics 
gains? 



Early childhood 
interventions are 
widely thought to 
have the largest 
potential effect on 
academic achieve- 
ment, partly because 
of their influence on 
brain development. 



The most striking feature of the NAEP results 
for blacks is the size of adolescents' gains for co- 
horts entering from 1968-72 to 1976-1980. These 
gains were 0.6 standard deviation across subjects. 
Such large gains for very large national populations 
over such short time periods in tests similar to the 
NAEP are rare, if not unprecedented. Scores on IQ 
tests given to national populations seem to have in- 
creased gradually and persistently throughout the 
twentieth century, both in the United States and else- 
where (Flynn, 1987; Neisser, in press). While evi- 
dence exists for large gains on the RAVENS test 
which measures a narrower ability than tests like 
the NAEP, the gains on tests similar to the NAEP 
have averaged about 0.02 standard deviations per 
year — a fraction of the black rate in the 1980s. Nei- 
ther these gradual, persistent gains in IQ scores can- 
not be explained, nor can it be explained whether 
the gains are larger for minority or other 
subgroupings of the population 
(Flynn, 1987). But no evidence ex- 
ists in this data involving large 
populations showing gains of the 
magnitude made by black students 
over a 10-year period. 



• Why did significant black 

gains occur in all regions of the country with 
somewhat higher gains in the south and west? 

• Why were black-white gap reductions fairly 
uniform across regions? 

• Why did low-scoring students gain more in 
mathematics and less in reading than higher 
scoring students, regardless of race? 



It is even unusual to obtain 
gains of this magnitude in intensive 
programs explicitly aimed at rais- 
ing test scores. Early childhood in- 
terventions are widely thought to 
have the largest potential effect on 
academic achievement, partly be- 
cause of their influence on brain 
development. Yet only a handful 
of "model" programs have reported 
gains as large as half a standard deviation (Barnett, 
1995). These were very small-scale programs with 
intensive levels of intervention. Even when early 
childhood programs produce large initial gains, the 
effects usually diminish over time. Among blacks 
who entered school between 1968 and 1978 gains 
were very large among older students and were not 
confined to small samples, but occurred nationwide. 



The race x region data is unpublished and was provided by Michael Ross of the National Center for Education Statistics. We 
only have regional data for 1971-92. For the purpose of reporting NAEP data, the Department of Education places Texas in 
the West whereas, the U.S. Bureau of the Census places Texas in the South. This is important when interpreting black regional 
scores in the West, because a sizable proportion of these blacks are in Texas. 
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Figure 6.— Change in NAEP reading scores between 1971 and 1992, by region, race/ 
ethnicity, and age 
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SOURCE: U.S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 
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Large changes in scores of 0.5 standard devia- 
tion and more which are sustained through older 
ages have been observed when sustained interrup- 
tions in schooling occurs at younger ages (Ceci and 
Williams, 1997). Black students typically gain about 
0.4 standard deviations per year on the NAEP tests 
between the ages of 9 and 13. In terms of "grade 
equivalents," black adolescent gains were equiva- 
lent to approximately 1.5 years of additional school- 
ing. The large black gains sustained for older stu- 
dents suggests that there may have been a major 
change in the quality of blacks' school experience 
beginning in the late 1960s. This change in school 
experiences could reflect social and legal changes 
aimed at equalizing educational opportunity, addi- 
tional educational resources that were especially 
helpful for black students, and the implementation 
of civil rights legislation creating 
new job opportunities for academi- 
cally successful blacks, which may 
have made black students more ea- 
ger to take advantage of any oppor- 
tunities their schools provided. 

However, before testing more 
specific hypothesis about changes 
in schools, we need to account for 
how changes in families may have 
affected test scores. Family charac- 
teristics account for the largest part 
of the variance in test scores in cross 
sectional models, and family char- 
acteristics changed significantly in 
this period. Thus, it is important to 
estimate how changing families 
would be expected to change achievement scores. 



Family Changes 

The available evidence would indicate that 
changes in the family would be expected to have a 
positive effect on test scores from 1970 to 1990 
(Grissmer, et al., 1994) (Cook and Evans, 1998). 
Higher parental education and smaller family size 
are the main factors leading to higher predicted test 
scores of approximately 0.2 standard deviation for 
black and white students. The sizes of the predicted 
effects are about the size of the white score gains, 
but much smaller than the score gains of blacks. 8 
While these family gains can account for nearly all 
white score gains, they can explain only approxi- 
mately one-third of black gains during the NAEP 
years. 

Therefore, we must turn to events in the educa- 
tional system: the growth of pre- 
schools and kindergartens, deseg- 
regation in the South, declines in 
class size, increases in teachers' age 
and experience and increases in the 
amount of teachers' education. 
Some of these factors would be ex- 
pected to affect scores only at cer- 
tain ages or for certain subjects or 
primarily in certain regions of the 
country, while others could poten- 
tially affect scores nationally at all 
ages in both subjects. 

Changes in Schooling 
and Educational 
Resources 

An assessment of the impacts of these factors 
on NAEP scores discounts many of them as substan- 
tial contributors to the overall black gains for all age 
groups (Grissmer et al., 1998). 9 In examining each 



Family characteristics 
account for the 
largest part of the 
variance in test 
scores in cross 
sectional models . . . 



8 Even these modest estimates of the gains attributable to improvement in family background may be too high. Consider the 
case of parental education. Parental education is correlated with children's test performance for two reasons. First, education 
changes parents in ways that make them more likely to provide their children with an environment conducive to learning. 
Second, education is a proxy for innate characteristics of parents that they pass along to their children. These innate characteristics 
also enhance children's test scores. When parents stay in school longer, their child-rearing practices probably change, but their 
innate characteristics do not. Keeping parents in school longer is, therefore, unlikely to raise children's test scores as much as 
we would expect on the basis of the cross-sectional estimates. 

9 This section presents a summary of much more detailed evidence provided in Grissmer, et al. (1998) for the size of expected 
effects from the changes in schooling and education cited in the rest of the article. 
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factor the evidence was assessed for how much a 
factor changed during the NAEP period, how many 
youth experienced the change and whether it 
changed more for blacks, how large the expected 
effects might be and whether they might be larger 
for black students, and how well the changes match 
the changes in NAEP scores. 

Kindergarten attendance also increased during 
this period because of state mandates. A proxy 
measure of increasing attendance is the percentage 
of 5-year-olds in school (either kindergarten or first 
grade). About 66 percent of the entering class in 
1960 were in K-l at age 5 versus 89 percent for the 
entering class of 1990. 10 There is also a shift toward 
full- rather than half-day attendance. In 1970 only 

12 percent of 5-year-olds attended for a full day, ver- 
sus 41 percent in 1991 . u Finally, black participation 
has increased somewhat faster than white partici- 
pation. In 1969, 78 percent of white 
and 67 percent of black 5-year-olds 
were in K-l while the percentages 
were almost equal in 1990. 12 Once 
again, strong differential effects by 
race would be required to affect the 
black-white gap. 

The empirical evidence sug- 
gests that the large growth in pre- 
school participation and kindergar- 
ten may have a limited impact on 
9-year-old scores, but would not 
significantly impact scores at ages 

13 and 17 (Barnett, 1995) (Karweit, 

1989). For pre-school the evidence 
is much stronger due to several methodologically 
strong studies. The evidence shows that small-scale, 
intensive interventions can have effects of 0.5 stan- 
dard deviation or more in the short term, but its 
effect lessons for almost all studies measured at age 
9 or older. Large-scale public programs show even 
weaker effects with similar fade-out. 



The kindergarten evidence is based on differ- 
ences between half-day and full-day attendance 
which show significant short-term effects, but simi- 
lar fade-out effects. Larger short-term effects might 
be expected from the change from no attendance to 
half-day attendance, but it would counter the avail- 
able evidence from both pre-school and full day kin- 
dergarten attendance for long-term effects to result 
from such a change. Similar to pre-school, more kin- 
dergarten participation might have residual effects 
to age 9, but no longer-term effects would be ex- 
pected. 13 

The percentage of black high school graduates 
completing a minimum set of specified courses (4 
years of English, 3 years of social science, 2 years of 
science and 2 years of mathematics) increased from 
32 percent in 1982 to 76 percent in 1994. However, 
gains for white students were similar — from 33 to 
76 percent. Changes in course work 
at the high school level in the 1980s 
may explain part of the score in- 
creases for older black and white 
students, but probably cannot ex- 
plain much of the differential black 
score gains since course work 
changes were similar for black and 
white students. 

Desegregation also appears to 
offer an explanation for a small part 
of the black score gains for all ages. 
Desegregation occurred primarily 
in the south over a short period in 
the late 1960s and early 1970s, but 
the regional NAEP data shows that black score gains 
occurred in all regions of the country. While the larg- 
est gains appear to have been made in the south, 
the extra southern gains accounts for less than 20 
percent of overall black gains. 



The empirical evidence 
suggests that the large 
growth in pre-school 
participation and 
kindergarten may have 
a limited impact on 9- 
year-old scores, but 
would not significantly 
impact scores at ages 
13 and 17 . . . 



10 Ideally, a measure of the percentage of those entering first grade who attended kindergarten is needed. By 1990, over 98 
percent of children entering school attended kindergarten. However, such data is not available for earlier years. Our measure 
does not approach 98 percent because an increasing number of children in the 1980s delayed school entry for one year and 
attended kindergarten at age 6. 

11 Data from Digest of Education Statistics, 1992, table 47. 

12 Digest of Education Statistics, 1970, table 4 and Digest of Education Statistics, 1992, table 7. 

13 Similar to preschool also, some small-scale, specially designed kindergarten programs appear to have substantial short-term 
effects (Karweit, 1989). It is possible that kindergarten curriculum has shifted with some effects at age 9 as well. 




Does Money Matter ? 



Nationwide, three significant changes took place 
in schools during this period — lower class size, and 
better educated and more experienced teachers. 
Unlike many of the changes cited above, these 
changes, would have been experienced by nearly 
all students at all ages. Thus, if such changes would 
be expected to have effects on achievement, these 
changes would better explain NAEP score gains for 
all age groups and subjects. 

Teachers' average level of experience declined 
in the early 1960s as substantial numbers of inexpe- 
rienced teachers were hired to teach the baby 
boomers. As enrollments fell in the 1970s, the flow 
of new, inexperienced teachers was substantially 
reduced; the average experience level of teachers 
grew substantially from 1970 to 1990 (Grissmer and 
Kirby, 1997), and, by 1990, a significant number of 
teachers had 20 or more years of experience. We as- 
sume here that teachers as a group 
are most productive between 5 and 
20 years of experience. Figure 8 
shows the average changes in the 
percentage of teachers with 5-20 
years of experience who would be 
teaching an entering cohort of age 
9, 13, or 17 children. 14 The percent- 
age of teachers in this experience 
range grew considerably for cohorts 
entering from the 1960s to the mid- 
1980s, then fell with the growth of 
teachers with 20 or more years of ex- 
perience. 

Along with this gain in experi- 
ence came more teachers with 
master's degrees. Figure 9 shows that the percent- 
age of teachers with master's degrees experienced 
by an entering school cohort rose for all age groups. 
The education level of teachers grew steadily for en- 
tering school cohorts from the 1960s to the 1990s — 
although the period of fastest growth was from the 
1960s to the mid-1980s. The size of classes was also 



reduced substantially during this period. Figure 10 
shows the average pupil /teacher ratio for entering 
school cohorts up to age 9, 13, and 17. The pupil/ 
teacher ratio — a measure of class size — also fell dra- 
matically for cohorts entering school in the 1960s 
and 1970s, but slowed considerably in the 1980s. 
These changes occurred at both elementary and sec- 
ondary levels although the timing was somewhat 
different. Part of the reason class sizes fell was also 
related to the baby boom. As enrollments dropped 
in the 1970s, rather than terminate teachers, the 
opportunity was used to reduce class size. 

The empirical evidence on the effects of these 
three variables more greatly impacts class size than 
teacher experience or education for two reasons. 
First experimental evidence exists for the effects of 
class size. Second, the accuracy of all measurements 
using non-experimental data is now being ques- 
tioned. 

A large, multi-district study in 
Tennessee that randomly assigned 
students to classes of approxi- 
mately 14 students instead of ap- 
proximately 22, found that reduc- 
ing class size between kindergarten 
and third grade had significant ef- 
fects on achievement, and even 
greater effects for blacks (Krueger, 
1997; Mosteller, 1994). The effects 
averaged about 0.20 standard de- 
viations for whites and 0.30 stan- 
dard deviations for blacks, with 
equal effects in reading and math- 
ematics. Following the experiment, 
Tennessee also cut class sizes in 13 school districts 
with the lowest family income. Comparisons with 
other districts and test score changes within these 
districts showed gains of 0.35 to 0.5 standard devia- 
tions (see Mosteller, 1994). 15 The Tennessee data 
suggest that disadvantaged students may experi- 
ence the most gains from class size reductions. 



Nationwide, three 
significant changes 
took place in schools 
during this period — 
lower class size, and 
better educated and 
more experienced 
teachers. 



14 We somewhat arbitrarily chose 5-20 years of experience as the period of peak productivity for teachers assuming an early 
learning curve for teachers and some average "burnout" effect after 20 years of service. The data here is the average percentage 
of teachers with 5-20 years of experience during the schooling experience of each age group from school entry. Thus 9-year- 
old students who entered school in 1970 would be estimated by taking the average percentage of teachers with 5-20 years of 
service between 1970 and 1973. For 13-year-old students entering in 1970, the average would be from 1970 to 1977. 

15 Other research on school districts in Alabama shows similar overall effects when using models without prior year test score 
controls (Ferguson and Ladd, 1996). 
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Figure 8. — Average percentage of teachers with between 5-20 years of teaching experience 
in years of attendance 
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SOURCE: U.S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 




Figure 9. — Average percentage of teachers with a master’s degree or higher for years of 
attendance 
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SOURCE: U.S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 
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Figure 10. — Average cohort pupil/teacher ratio for years of attendance, by year of school 
entry 
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SOURCE: U.S. Department of Education, National Center for Education Statistics, National Assessment of Educational 
Progress (NAEP). 



It is likely that had a similar experiment been 
done using a national sample of K-3 students in the 
1970s, even larger differences between black and 
white test scores would have been measured. This 
is because white students in Tennessee are poorer 
than average white students nationwide and black 
students in Tennessee probably were less disadvan- 
taged in 1990 than blacks nationwide in the 1960s 
and 1970s. National reductions in class size in the 
1960s and 1970s were approximately the same as 
the Tennessee experiment, thus effects ranging from 
0.3 to 0.6 might be expected for black students and 
0.00 to 0.20 for white students from the national class 
size reductions. Thus, reductions in class size may 
be a key factor in explaining large black gains and 
why black scores may have risen much more than 
white scores. 

The Tennessee experimental evidence also 
leaves a number of questions unanswered. First, 
we do not know much about either the long-term 
effects of smaller elementary school classes or the 



0 




cumulative effects of smaller classes from kinder- 
garten through twelfth grade. In the Tennessee ex- 
periment, students were returned to large classes 
after third grade. By seventh grade the standard- 
ized benefits of smaller classes were only half as 
large as they had been at the end of third grade, 
and the benefits to black students were no larger 
than the benefits to whites (Mosteller, 1995). We do 
not know what would have happened if classes had 
remained small until students finished school. Sec- 
ond, the Tennessee measurements may only repre- 
sent a short-term effect since only a single cohort 
was measured. Teachers and policymakers may be 
able to adapt their teaching and policies to take bet- 
ter advantage of smaller class sizes in the longer 
term. 

Another problem with the hypothesis that class 
size reductions raised test scores is that class size 
fell in the 1960s as well as the 1970s. If smaller classes 
had conferred long-term benefits, 17-year-olds who 
entered school in 1968 should have outscored those 
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who entered in 1960. This was not the case outside 
the South. 16 Further research is required to test the 
class size hypothesis as a strong contributing factor 
to black gains. 

One important side effect of the Tennessee ex- 
periment is that it raised new doubts about non-ex- 
periment al studies conducted in a "production func- 
tion" framework. 17 These studies typically try to 
control standardized scores at lime One and then 
discern whether a resource like smaller classes af- 
fects gains between Time One and Time Two. In 
Tennessee, however, smaller classes exerted their 
entire effect on standardized scores in the first year. 
Thereafter, smaller classes simply served to sustain 
the initial standardized gains. Thus, the estimated 
effect on smaller classes in grades one through three 
would have been zero. 

Current empirical measurements of the effects 
of teacher education and a master's degree show 
no consistently strong effects — but better specified 
models might change these results. It remains to be 
seen whether more teacher education and more ex- 
perience raised achievement scores awaits stronger 
empirical evidence and determination of the cur- 
rent flaws in specifying estimation models. 

Discussion 

Recent research is undermining several of the 
assumptions and empirical evidence underlying the 
"money doesn't matter" conclusion. The validity 
of the empirical studies reviewed to arrive at this 



conclusion is being questioned due to the use of 
model specification which would not reproduce the 
experimental class size results. It is possible that 
fundamental flaws are present in nearly all non-ex- 
perimental studies of the effects of school resources 
due to the methods of model specification. Second, 
experimental data which avoids the assumptions 
needed in models with non-experimental data in- 
dicates that reductions in class size — a key school 
resource parameter — have significant effects with 
larger effects for minority students. Third, NAEP 
data — which had previously been used together 
with the large perceived increases in school re- 
sources to support the "money doesn't matter" ar- 
gument — now seems more supportive of a differ- 
ent conclusion. This evidence seems to support the 
thesis that money directed at minority and disad- 
vantaged students brings higher achievement 
scores, but money directed toward more 
advantaged students may have much smaller or 
negligible effect. Moreover, the additional money 
available in the 1960s to 1990s was much less than 
previous estimates. Instead of doubling in real 
terms, the real increases directed toward achieve- 
ment of regular students was closer to 30 percent 
during this period. These additional resources were 
also disproportionately directed toward minority 
and lower income students. Thus, a more consis- 
tent story is emerging from the empirical data which 
is more supportive of the thesis that additional 
money matters greatly for minority and disadvan- 
taged students, but much less or little for 
advantaged students. 



16 Class size effects may depend on how teachers change their behavior when they have smaller classes (Mumane, 1996). The 
effects of smaller classes may take several years to appear, because teachers and students need time to adjust their behavior to 
smaller classes. However, effects appeared immediately in Tennessee and increased very little as a result of additional years 
in small classes. The Tennessee experiment did not address the important question of whether effects would grow at each 
grade level as teachers experienced smaller class sizes over many years. 

17 The Tennessee data shows that gains from smaller classes appear immediately and grew by small amounts over the first three 
grades (Krueger, 1997). This implies that production functions that utilize previous year's scores would not measure the class 
size effects evident in Tennessee. Two studies that have used some of the best data at the state level and had prior year's test 
scores as controls were considered to be among the strongest studies (Ferguson, 1991 and Ferguson and Ladd, 1996). However, 
these specifications now appear to provide biased results. In the latter study, results are also presented of cross-sectional 
estimates without controls, and these results may now have more credibility than those with prior year controls. Generally, 
the results of the model without prior score controls show stronger effects for most variables. 
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Introduction 

School reform proposals aimed at improving 
student achievement range from developing new 
standards and curriculum to personalizing student- 
teacher relationships and increasing time for teach- 
er planning and learning. Research over the last 30 
years has indicated that student achievement is 
higher, dropout rates are lower, and student affect 
and behavior are better in schools where students 
are well-known to their teachers (for reviews see 
Darling-Hammond, 1997; Braddock and 
McPartland, 1993; Lee, Bryk, and Smith, 1993). 
These findings have emerged from studies of school 
size, class size, and school organization. In terms of 
structural variables other than size, two critical con- 
ditions concern reduced tracking or curriculum dif- 
ferentiation and greater opportunity for students to 
work intensely with a smaller number of teachers 
over longer periods of time (Gottfredson and Daiger, 
1979; Lee, Bryk, and Smith, 1993; Lee and Smith, 
1994, 1995). Other studies have emerged that docu- 
ment problems with "pullout" models of instruc- 
tion that serve students' needs by assigning them 
for brief periods of time to a variety of specialists, 
each of whom is supposed to treat one "problem" 
independent of the others (Commission on Chapter 
1, 1992; Soo Hoo, 1990). 

Meanwhile, reformers continually note that 
teachers need substantial time together in order to 
create new practices and engage in shared problem 




solving (see, for example, Darling-Hammond, 1997; 
Little, 1993; Sizer, 1992). Studies of teacher devel- 
opment in other countries, notably China and Ja- 
pan, point out how teachers become more proficient 
from continually working on curriculum, demon- 
stration lessons, and assessments together (National 
Commission on Teaching and America's Future, 
1996; Stigler and Stevenson, 1991). The new cur- 
riculum reforms in the United States require sub- 
stantial teacher learning which is, ideally, content- 
based and collaboratively pursued in tight connec- 
tion to teachers' ongoing classroom work (Ball and 
Cohen, in press). In an era of belt-tightening and 
rising student enrollments, finding the resources for 
these reforms will require schools to reexamine the 
use of every dollar. 

Much publicity has surrounded efforts to redi- 
rect dollars from administrative functions back to 
the classrooms. New Jersey's governor has even re- 
fused to reimburse district administrative costs 
above a fixed percentage that is lower than many 
districts currently spend. However, little attention 
has been given to rethinking the use of existing 
instructional resources, especially teachers — 
schools' most important and expensive resource. 

On the surface it would seem that schools should 
have the needed resources to create more individual 
time for students and increase professional time for 
teachers. From 1960 to 1992, the number of pupils 
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per teacher dropped from 26 to 17.6. Furthermore, 
schools employ 1 adult for every 9 students (NCES, 
1994). Despite these generous ratios, class sizes 
exceed 25 for most students most of time, student 
loads are well over 100 for most teachers in second- 
ary schools, and teacher planning time is both mini- 
mal and conducted in isolation from other teachers. 
The basic structure of schools has remained essen- 
tially the same across districts and over time, with 
new resources added largely around the regular 
classroom, rather than into it. And despite recent 
calls for "restructuring," a number of surveys sug- 
gest that public schools rarely engage in major real- 
locations of resources (Rettig and Canady, 1993). 

How might existing resources be used to per- 
sonalize student-teacher relationships and provide 
teachers time to plan and work more closely to- 
gether? Can these goals be met in ways that also 
support student learning for all stu- 
dents, including those with special 
needs? With so few examples of pub- 
lic schools organized in different 
ways, there is little empirical re- 
search on these questions. Further- 
more, researchers have not had com- 
mon ways to describe and measure 
different models. This study aims to 
begin to fill these gaps by describ- 
ing in detail how five schools have 
reallocated resources while support- 
ing high levels of student learning. 

The sample is too small and the 
schools too unique to claim a causal 
connection between their organiza- 
tional designs and their students' 
successes. 1 However, the schools do demonstrate 
that it is possible to support student achievement at 
extraordinarily high levels in contexts that manage 
instructional resources to maximize individual at- 
tention for students and learning time for teachers. 

We sought to identify the elements of organiza- 
tion and resource use that seem most important in 
these contexts and to quantify objectively the ways 
in which these schools use resources differently than 



traditional schools. We also aimed to learn from 
these examples about the conditions that facilitate 
or hinder resource restructuring. Although the 
schools look very different from one another, they 
share six principles of resource allocation imple- 
mented in different ways depending on their spe- 
cific educational goals and strategies. In what fol- 
lows, we describe their approaches, present a frame- 
work for examining resource allocation and use, and 
develop a methodology that may be used to mea- 
sure the extent to which schools use their resources 
in focused ways to support teaching and learning. 

Opportunities for Fundamental 
Reallocation of Resources 

It is unlikely that schools can find ways to cre- 
ate more individual time for students or more shared 
planning time for teachers without prohibitively 
raising costs, unless they rethink 
the existing reorganization of re- 
sources. In this article, we focus 
primarily on the assignment and 
use of teaching staff because it is 
the most sizable and the most 
underexplored area for potential 
resource reallocation. As we noted 
above, pupil teacher ratios have 
dropped substantially and real 
expenditures have nearly doubled 
over the last 30 years. A recent 
analysis of staffing and spending 
patterns from 1967 to 1991 in nine 
very different districts across the 
country found that few of the new 
teaching staff were deployed to re- 
duce class sizes for regular education students; most 
went to provide small classes to the growing num- 
ber of students in special programs and to add a 
modest amount of time for teachers free from in- 
struction during the school day (Miles, 1997a, 1997b; 
Rothstein and Miles, 1995). 

As instructional staff have increased, the pro- 
portion of teachers has declined. Since 1950, the 
proportion of school staff who are classified as teach- 



... we focus prima- 
rily on the assign- 
ment and use of 
teaching staff be- 
cause it is the most 
sizable and the most 
underexplored area 
for potential resource 
reallocation. 



1 




Some large-scale quantitative studies do, however, suggest that student achievement is correlated with school designs that 
enable teachers to spend more extended time with smaller numbers of students and that allow teachers to make instructional 
decisions in teams (Gottfredson and Daiger, 1979; Lee, Bryk, and Smith, 1993; Lee and Smith, 1995; Darling-Hammond, 1997). 
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ers has dropped from 70 percent to 53 percent, not 
all of whom are classroom teachers. Overall, about 
43 percent of staff are regularly engaged in class- 
room teaching (National Commission on Teaching 
and America's Future [NCTAF], 1996). By contrast, 
60 percent to 80 percent of education staff in most 
European countries are classroom teachers, enabling 
much greater time for collaborative planning and 
professional development (OECD, 1995). 

An analysis of the allocation of teaching re- 
sources in Boston public schools identified six rea- 
sons for the gap between potential and reality in 
U.S. schools (Miles, 1995). Many of these organiza- 
tional practices are so widespread that Tyack (1994) 
describes them as the "grammar of schooling" while 
Sarason (1982) calls them "school regularities." 
These practices form the basis of our conceptual 
framework for understanding and quantifying the 
use of teaching resources in both 
traditional and nontraditional 
schools. Below we briefly describe 
the impact of each on the use of 
teaching resources. 

1. Specialized programs con- 
ducted as add-ons. In most 
school districts, a large portion 
of teachers work outside the 
regular classroom with special 
populations of students in cat- 
egorical programs such as spe- 
cial education, Title 1 compensa- 
tory education, bilingual educa- 
tion, remedial education or 
gifted education. Such programs 
for special student populations absorbed 58 per- 
cent of the new dollars devoted to education from 
1967 to 1991 (Rothstein and Miles, 1995). Most of 
these programs operate under federal, state, or 
district regulations and sometimes collective bar- 
gaining agreements that prescribe how teachers 
may be used and students may be grouped. Most 
districts operate these programs using a 
"pull-out" model in which students leave the 
regular classroom for portions of the day for in- 



struction in small groups. In 1991, in Boston, 
teachers in specialized programs working outside 
the regular classroom represented over 40 percent 
of the teaching force. 2 Not only are pull-out strat- 
egies extremely costly, they also segregate stu- 
dents in sometimes stigmatizing ways and pro- 
vide services that are often ineffective due to their 
fragmentation and lack of connection to the 
student's classroom experience. Schools rethink- 
ing resources will consider how remedial, special 
education. Title 1 and bilingual education re- 
sources might work together in an integrated plan 
to benefit all students in "regular education" set- 
tings. 

2. Isolated instruction-free time for teachers. Cur- 
rently, most school districts provide teachers with 
short periods of time free from instruction while 
using other classroom teachers to give instruction 
at these times. At the elementary 
level, teachers often have a 45 
minute duty-free period four or five 
times a week which is covered by 
specialists in art, music, or physical 
education. In 1991, this represented 
9 percent of Boston's elementary 
teaching resources. At the second- 
ary level, a teacher might teach 5 of 
7 instructional periods, using one for 
planning and the other for lunch or 
duties (monitoring hallways, the 
lunchroom, or study hall). Other 
teachers cover instruction during 
the portion of the student's instruc- 
tional day in which the teacher is not 
teaching. Although secondary 
teachers have more preparation time than elemen- 
tary teachers (about 4 or 5 hours per week as op- 
posed to 3 hours), the short blocks of individual 
non-instructional time do not allow much sub- 
stantive planning or collaboration. These activi- 
ties would require longer blocks of uninterrupted 
time that is coordinated with other teachers. 
Schools rethinking their use of resources will con- 
sider ways of creating longer periods of time for 
teachers to plan and develop curriculum together. 



An analysis of the 
allocation of teaching 
resources in Boston 
public schools identi- 
fied six reasons for 
the gap between 
potential and reality 
in U.S. schools . . . 



2 A similar analysis quantifies the impact of these practices in three other districts: Fall River, Massachusetts, Middletown, N. Y., 
and East Baton Rouge, Louisiana. See Miles 1997a. 
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3. Formula-driven student assignment. In search 
of efficiency and standardization, American 
schooling processes have been broken into seg- 
ments, like grades and tracks, through which stu- 
dents are expected to move at an even, uniform 
rate. Districts use formulas to assign students to 
classrooms in a regularized fashion by age, sub- 
ject, and program. These practices are costly, be- 
cause the uneven allocation of teachers over 
grades, small programs and undersubscribed 
subjects contributes to unplanned differences in 
class size that do not reflect educational strate- 
gies. These practices may also preclude ap- 
proaches such as multi-age or multi-ability group- 
ing that can be more effective when teachers are 
prepared to engage in them (Anderson and 
Pa van, 1993; Slavin, 1990). 

Using formulas to allocate students to classrooms 
by age can create huge variation 
in class sizes. For example, Bos- 
ton public schools cap elementary 
class sizes at 28. When the 29th 
student enters a school with only 
one class in that grade, a new 
teacher must be added. Thus, the 
class size average falls dramati- 
cally from 28 to 14.5. In 1991, 
regular elementary class sizes in 
Boston's 645 elementary class- 
rooms varied from 15 to 31. Class 
sizes could vary by 8 or 9 students 
from one grade to another in the 
same school. The more separate 
programs and subjects a school 
has, and the more constrained by 
age grading or tracking practices, the more often 
this kind of unplanned variation in allocation of 
resources occurs. Schools looking to better match 
resources to student needs will consider assigning 
students to groups based on educational strate- 
gies rather than standard classifications. Schools 
may also strategically vary group sizes and the 
daily schedule for particular kinds of lessons or 
skill work. 

4. Fragmented high school schedules and curricu- 
lum. The problems of age grading are com- 
pounded in high schools by tracking, schedules 
with large numbers of short periods (typically 45 
to 50 minutes), and teacher and subject special- 



ization. One unfortunate effect of this fragmented 
approach to schooling is its impact on teachers' 
pupil loads. With five classes of 25 to 30 students 
each, most secondary teachers work with 125 to 
150 students per day. Reducing teaching loads 
without dramatically increasing costs can be ac- 
complished by reducing the number of different 
groups teachers teach, either by combining tra- 
ditionally separate subject areas, lengthening the 
duration of classes, or having smaller groups of 
students work intensively with teachers in smaller 
numbers of subjects, as in universities (Carroll, 
1994). 

5. Large high schools. Secondary schools average 
nearly twice the size of elementary schools (NCES, 
1994), and frequently exceed 2,000 students. The 
conventional justification is that larger enroll- 
ments create economies of scale by distributing 

administrative and operating costs 
and offering a more diverse cur- 
riculum cost effectively. However, 
existing research suggests that 
high schools have created more in- 
ternal specialization and depart- 
mentalization than can be scientifi- 
cally justified. Most studies have 
found that, all else equal, above 
about 800 students, larger schools 
produce lower student achieve- 
ment and lead to increased alien- 
ation, higher dropout rates, and 
larger numbers of administrative 
staff, thereby deflecting resources 
from classroom instruction. 
Furthermore, beyond about 400 
students, there are few if any gains in curriculum 
quality (for reviews, see Lee, Bryk and Smith, 
1993; Darling-Hammond, 1997). 

6. Inflexible teacher work day and job definition. 
In Boston, like many other cities, the teacher con- 
tract specifies the required hours of work, start- 
ing and ending times, and how teachers can be 
assigned over the day. This makes it difficult to 
stagger starting times in order to make the best 
use of staff time or to meet student needs more 
fully. The contract prevented one high school 
from changing the work hours of its guidance staff 
to start and end later in the day so that students' 
meetings with counselors would not conflict with 
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their courses. The contract also requires that plan- 
ning time be spread out over the day; this makes 
it difficult to combine instruction-free periods to 
create longer, shared blocks of time. 

The use of part-time teachers is forbidden if they 
substitute for potential full-time positions. While 
intended to discourage management from substi- 
tuting lower cost and potentially lower quality 
teachers for dedicated full-time ones, this regula- 
tion limits affordable solutions to coverage for teach- 
ers' noninstructional time. One way to create com- 
mon planning time for groups of teachers, for ex- 
ample, is to schedule coverage for them by hiring a 
larger number of part-time specialist teachers dur- 
ing this time. Finally, rigid definitions of the work 
day exclude individuals who would be willing to 
work at times beyond the typical school hours. 
Schools looking to better match resources to student 
and staff needs may want to con- 
sider the use of highly-skilled staff 
in part-time positions and on var- 
ied job schedules. 



not free enough resources to significantly change the 
possibilities for student or teacher learning. 

Study Methods and Analytic 
Framework 

Sample 

To create a sample of schools that could offer 
insight into the possibilities and challenges involved 
in rethinking the allocation of instructional re- 
sources, the study sought elementary and second- 
ary schools that: 

1. Had engaged in a significant rethinking of re- 
sources touching on at least four of the resource 
principles listed above. 

2. Used no significant extra resources above the 
school system average per pupil except startup 

or training grants. 



The strategies observed in 
schools that have sought to change 
these conditions can be described 
by six principles of resource reallo- 
cation: 

1. Reduction of specialized pro- 
grams to provide more indi- 
vidual time for all in heteroge- 
neous groups, 

2. More flexible student grouping 
by school professionals, 

3. Structures that create more personalized environ- 
ments, 

4. Longer and varied blocks of instructional time, 

5. More common planning time for staff, and 

6. Creative definition of staff roles and work sched- 
ules. 

These kinds of changes can create opportuni- 
ties for realigning teaching resources to school goals 
(Miles, 1995). Our observations in schools suggest, 
however, that altering any one practice alone may 



Our observations in 
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however, that altering 
any one practice 
alone may not free 
enough resources to 
significantly change 
the possibilities for 
student or teacher 
learning. 



3. Served a diverse student popu- 
lation in terms of income, ability, 
language background, and spe- 
cial needs. 

4. Had used a new model of orga- 
nization for at least 2 years. 

5. Had evidence of strong and im- 
proving student performance. 



c 



BEST COPY AVAILABLE 



To find such schools, we sur- 
veyed experts involved in reform 

networks nationwide. The five 

schools we selected represent differ- 
ent educational strategies. Three of 
the schools started from scratch with a new design 
in mind, and had considerable flexibility in hiring 
staff and creating programs. The other two schools 
restructured existing programs and staff. We se- 
lected three elementary schools and two secondary 
schools: 

Quebec Heights Elementary School, Cincin- 
nati, Ohio had, at the time of the study, 500 stu- 
dents in grades K-6, with 15 percent classified as 
having special education needs and 70 percent 
eligible for Title 1. Quebec Heights eliminated 
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age- and program-based instructional grouping 
and put students in smaller, multi-aged, hetero- 
geneous groups that remain together for 3 years. 
The school created reading groups of 8 or smaller 
each day. Teachers have common planning time 
each day and pursue professional development 
in the school's priority areas during the school 
day. Cohort analysis of student performance data 
shows that both special education and regular 
education students have improved faster than the 
Cincinnati average. 

The Douglass Elementary School, Memphis, 
Tennessee had 475 students with 17 percent classi- 
fied as requiring special education and 88 percent 
qualifying for Title 1 support. At the time of the 
study, the school was in its third year of implement- 
ing the "Success for All" program which restruc- 
tured school resources to allow 90 minutes a day of 
reading plus daily individual tutoring for first and 
second graders who did not meet grade level stan- 
dards. In addition, Douglass was working to inte- 
grate its special education students and teachers 
fully into the regular classroom. After the second 
year of implementing the program, the percent of 
second-graders (the only students with two years 
of the new model) above the median in language 
arts climbed from 17 to 59 percent. In addition, the 
school's evaluation of special education integration 
showed these students continuing to progress aca- 
demically and socially. 

The Mary C. Lyons Model Elementary School, 
Boston Massachusetts had 90 students in grades K— 
5, 60 of whom were classified as regular education 
and 30 of whom had severe emotional disturbances 
that previously required placement in highly restric- 
tive settings. Over 80 percent of students qualified 
for Title 1. The school fully integrated all special 
education students into regular classes of 15 or 
smaller, each with a teacher and instructional assis- 
tant. Mary C. Lyons created extended school hours 
lasting from 7:00 am to 5:15 p.m. using outside con- 
tractors to provide instruction. The variety of dif- 
ferent staffing arrangements included parapro- 
fessionals, teacher interns, part-time workers, and 
staggered shifts. The school was 1 of 15 (out of 115) 
Boston schools to be over-chosen by every race for 
both special education and regular education slots 
$ for 3 years in a row. Achievement scores for both 
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special education and regular education students 
improved faster than the Boston average, and 100 
percent of the students were reading on grade level. 

Central Park East Secondary School (CPESS), 
New York, New York served 450 students in grades 
7 through 12, about 25 percent of whom qualified 
for special education and 60 percent for free or re- 
duced-price lunch. All students were integrated into 
heterogeneous classrooms. The school restructured 
the typical secondary schedule to create two hour 
blocks of instructional time in both the humanities 
and math /science. Teachers had more than 7 hours 
each week of common planning time in addition to 
their daily individual preparation period. To reduce 
academic group sizes, CPESS allocates nearly all of 
its positions for teaching, rather than hiring guid- 
ance counselors and various administrative staff. 
All professional staff members serve as advisors to 
about 12 to 15 students each year. The school hires 
some teachers on a part-time consulting basis for 
electives like foreign languages. CPESS has been 
nationally heralded for its consistently exceptional 
outcomes: each year since its first graduating class, 
more than 90 percent of its students have gradu- 
ated and more than 90 percent have been accepted 
to college. 

International High, New York, New York is an 
alternative school of 475 recent immigrant students 
in grades 9 through 12. Only students who have 
been in the United States less than 4 years and who 
score below the 20th percentile on an English lan- 
guage proficiency exam are admitted. At the time 
of the study, over 75 percent of the students were 
eligible for free or reduced-price lunch. International 
High integrates all state-mandated subject matter 
in an interdisciplinary curriculum taught in 
multi-aged heterogeneous groups. Teachers work 
with no more than 75 students a term and spend 70 
minutes or more with them each day. The teachers 
have nearly six hours each week of common plan- 
ning and professional development time. All staff 
members lead a small advisory group which meets 
weekly to discuss issues of personal, academic, and 
social growth. Despite its "high risk" population, 
the school's dropout rate was less than 1 percent in 
1993-94 as compared with 30 percent citywide, and 
both the graduation rate and college acceptance 
rates exceeded 95 percent. For more than a decade. 
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these rates have exceeded 90 percent annually (IHS, 
1995; Darling-Hammond, Ancess, and Falk, 1995). 

Figure 1 summarizes the resource allocation 
strategies used in the five sample schools. It shows 
that each school implemented many strategies for 
allocating teachers and teaching time to address stu- 
dent needs and create planning time. Only the three 
alternative schools — Mary C. Lyons, CPESS, and 
International High — created new teaching roles by 
contracting with other providers for teaching or by 
restructuring some teaching positions. 

Data Collection and Analytic 
Framework 

To understand the resource allocation practices 
in each of the five schools, we collected informa- 
tion about school expenditures, staffing, and student 
scheduling. We collected comparable 
data for traditional schools, along 
with district level budget and staff- 
ing information. We conducted in- 
terviews with administrators and 
teachers and examined available 
written material at each school to un- 
derstand the school's organization 
and its link to educational purposes 
and outcomes. 



volved can be seen through one example: the at- 
tempt to measure the impact of the principle "re- 
duction of specialized programs to create more in- 
dividual time for all." In a traditional school, regu- 
lar class size provides a useful gauge of how much 
access to individual attention a student might have. 
But, regular class size does not reflect the student's 
experience in some innovative schools because it 
does not describe the way these schools organize 
by subject and over the course of the day. For ex- 
ample, the regular class sizes of 24 at Quebec Heights 
school do not reflect the fact that all students spend 
90 minutes a day in groups of 8 for reading. In or- 
der to capture the additional individual time for all 
students, a measure of average instructional group 
size, rather than regular class size, is used. This mea- 
sure relies on greater descriptive knowledge of a 
school, but it more accurately reflects student expe- 



rience. 



. . . this study cre- 
ated measures which 
allow comparison of 
resource allocation 
patterns between 
these school models 



and traditional 
schools. 



In addition to describing the 
strategies each school used, this 
study created measures which allow 
comparison of resource allocation 
patterns between these school mod- 
els and traditional schools. The mea- 
sures were developed by taking each resource allo- 
cation principle, hypothesizing the quantifiable im- 
pact it might have on resources, and then testing 
this impact by using indices that are: (1) descriptive 
of practices in both traditional and nontraditional 
schools, (2) easy to understand, and (3) replicable. 

Creating measures that accurately portray prac- 
tices in the fluidly organized sample schools yet al- 
low comparison to traditional schools creates a ten- 
sion between the use of measures that are easily 
understood and calculated and those that can pro- 
vide meaningful description. The subtleties in- 



Figure 2 summarizes the mea- 
sures used for each resource allo- 
cation principle. The first prin- 
ciple, "reduction of specialized 
programs to create more indi- 
vidual time for all in heteroge- 
neous instructional groups" 
should lead to smaller average in- 
structional groups for all regular 
education students and more even 
distribution of resources between 
regular and special program stu- 
dents. Three measures helped as- 
sess the extent to which innovative 
schools differed from traditional 
schools here. 



1. Students per teacher: This number includes all 
teachers and students in the school from all pro- 
grams. Our sample elementary schools had simi- 
lar numbers of students per teacher. However, a 
school can reduce its functional student to teacher 
ratio by converting typical non-teaching slots to 
teaching roles as CPESS did. The index of students 
per teacher indicates only the opportunity to cre- 
ate small, flexible instructional groups. It does 
not reflect the actual size of the groups in which 
most students spend time. 
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Figure 1. — Resource reallocation strategies used by sample sites 
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2. More flexible student 
grouping 
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X 
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3. Structures to create more 
personal environments 


X 
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X 


X 
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4. Longer and varied blocks of 
instructional time 


X 


X 


X 


X 
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5. More common planning time 


X 


X 


X 


X 


X 


6. Creative definitions of staffing 
roles and work day 


X 






X 


X 



X=Sample school implements strategy. 
SOURCE: Unpublished tabulations. 



2. Weight average group size: This measure calcu- 
lates the weight average size of the instructional 
group which a typical student experiences over 
the day for academic subjects. It incorporates 
the time spent in different group sizes over the 
day. For example, if students in a classroom of 
24 spent 90 minutes a day (25 percent of their 
school day not including lunch) in reading 
groups of 8, then the weight average group size 
would be 20 (.75 times 24 plus .25 times 8). In a 
traditional school, the average group size and the 
regular class size would be the same. This mea- 
sure may offer a clearer sense of how much ac- 
cess to individual attention most students in the 
school have. 

3. Percent of teachers in regular education instruc- 
tional groups: This figure divides the number 
of teachers who work with regular education stu- 
dents (including classroom teachers, subject 
specialists and other teachers who work all day 
instructing groups that include regular educa- 
tion students) by the total number of teachers in 
the school. The figure gives a sense of the extent 
to which a school has concentrated its resources 
on core classroom functions as opposed to pull- 
out programs of various kinds. 

The second principle, more flexible student 

grouping, should allow educators to create instruc- 



tional groupings that more closely match instruc- 
tional needs. Formulas that mandate the size of 
groups and classrooms can create situations where 
group sizes vary for no educational reason. When 
teachers can create their own groups using criteria 
linked to educational strategies, they can reduce 
these unplanned variations and create a strategy 
which maximizes the use of limited resources. The 
percent of regular education students in targeted group 
sizes represents the extent to which a school has 
minimized random variation in class size. In schools 
where no group size target existed other than the 
contractually defined class size maximums, we 
measured how many students were in classes which 
were within 5 percent of the average size. More 
flexible student grouping also allows teachers to 
create smaller groups for target subject areas. The 
average size of instructional groups in focus area mea- 
sures how schools focus resources to create more 
individualized attention in some subjects where 
they do so. If some regular education students 
spend time in much smaller instructional groups, 
this would be reflected in the average by calculat- 
ing the percent of students receiving such support. 

Four aspects of the third principle, structures 
to support more personal relationships between 
teachers and students, lend themselves to measure- 
ment. First, a primary indicator of a teacher's op- 
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Figure 2. — Measuring resource allocation patterns (staff allocation) 
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for academic subjects 
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SOURCE: Unpublished tabulations. 



portunity to build relationships with each student 
is the academic teacher's student load. A second in- 
dicator of a school's effort to maximize personal re- 
lationships might be the percent of professionals who 
serve as instructors or advisors to regularly scheduled 
groups of students in an ongoing fashion. Thus, an 
assistant principal who worked with occasional dis- 
cipline problems or a guidance counselor meeting 
once with each of 200 students to ensure compli- 
ance with graduation requirements would not be 
included. Although these singular contacts with 
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students can be important, they do not aim to build 
long term, personal relationships between school 
professionals and students. The average size of teacher 
and student teams or clusters provides a third mea- 
sure of the opportunity to create a more personal 
educational environment. For this measure, 
student-teacher teams had to be self-managing and 
self-contained. This means that virtually all in- 
struction occurs within the cluster and that the clus- 
ter has primary responsibility for curriculum, group- 
ing, discipline, and evaluation of its students. A fi- 
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nal strategy schools might use to create personal 
relationships would be to keep teachers and stu- 
dents together for longer than the typical year. Thus, 
we include a measure of the number of years teachers 
and students stay together . 

The extent to which sample schools created 
longer blocks of instructional time is measured by the 
average scheduled length of instructional period for 
academic subjects in secondary schools. In some of 
the schools studied, teachers vary the length of in- 
struction from the schedule to suit a particular les- 
son. These variations were not calculated here. 

Finally, two measures are used to understand 
how sample schools create more useful common 
planning time for teachers. The number of minutes of 
common planning time is defined as the amount of 
time each week that is shared with other teachers 
and spent on collaborative plan- 
ning regarding curriculum, stu- 
dents, or school practices. A second 
indicator of the usefulness of the 
planning time is the length of the 
longest planning period . For some 
kinds of planning and develop- 
ment, teachers need time periods 
longer than the typical 40 to 50 min- 
utes. 



other support staff were the same as at Mary C. 
Lyons. 

International High in New York City serves a 
unique population of limited English speaking stu- 
dents. Traditional schools serve such students 
through bilingual programs and ESL courses offered 
separately from the rest of the high school curricu- 
lum, but do not typically have 100 percent of their 
population requiring such services. To create a com- 
parison to International High, we used the New York 
City staffing allocation formula to determine the 
number of teachers the school would have received 
for these special needs students; we assumed the 
additional resources would be used to fund sepa- 
rate bilingual or ESL classes. Although this gener- 
ous assumption about universal ESL services to lim- 
ited English proficient students does not hold true 
in many of New York's traditional schools, it offers 
a best case scenario for the alloca- 
tion of resources in a traditional 
model. 



Adjusted for stu- 
dent mix, the 
schools in this 
sample used no 
greater resources 
than traditional 



schools on an 



ongoing basis. 
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Each innovative school is com- 
pared with a typical school in the 
same district with a similar student 
population. Meaningful compari- 
sons must include an adjustment 
for the mix of students eligible for special services 
because schools typically receive additional re- 
sources to serve them. Adjusted for student mix, 
the schools in this sample used no greater resources 
than traditional schools on an ongoing basis. In two 
cases, no "traditional" school existed in the district 
which served the same mix of students as our 
sample sites. Mary C. Lyons Model Elementary 
School in Boston draws a large percentage of its 
population from special education students typi- 
cally served by private schools. In this case, a hy- 
pothetical comparison was created, based on the as- 
sumption that these students were served in sepa- 
rate, self-contained classrooms of 4 each, the small- 
est existing class size, and that social services and 



These calculations are intended 
to provoke discussion and to pro- 
vide an objective way of comparing 
schools to each other. Obviously, 
other factors which are not incorpo- 
rated in these measures contribute 
to the opportunity for individual at- 
tention and shared teacher time. 
For example, a teacher in a class of 
24 may use sophisticated grouping 
practices that allow her or him to 
provide targeted individual or 
small group instruction to students 
throughout the day. These grouping strategies are 
not incorporated into this measurement scheme 
unless the entire school uses the strategy. The exist- 
ence of planning and development time does not 
guarantee that it is used to improve teaching qual- 
ity. Further, many schools find common planning 
time for teachers outside the school day on a volun- 
teer basis. Thus, these measures are intended to be 
used in conjunction with a descriptive understand- 
ing of the way a school has organized itself to match 
resources to student needs and to provide opportu- 
nity for teacher growth. 
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Study Findings 

We discuss our findings concerning elementary 
and secondary schools separately because they be- 
gin with such different organizational structures. 
With their relatively small teaching loads and 
self-contained multi-subject classrooms, elementary 
schools already allow more flexible, individual in- 
struction. But their simple structures, with limited 
teacher time free from instruction, do not offer the 
same opportunities for freeing time and resources 
as secondary schools. Because of these simpler daily 
schedules, reducing the use of pull-out programs 
for special education, language and Title 1 instruc- 
tion becomes a primary lever for creating smaller 
groups for all in elementary schools. In contrast, 
traditional secondary schools, with their short peri- 
ods, multiple classes, large teaching loads, and 
greater amounts of non-teaching time offer more 
ways to reconfigure their resources. 

Elementary Schools 

Figure 3 presents the resource 
allocation measures for the three 
elementary schools. In the three 
urban districts studied, the tradi- 
tional schools served regular edu- 
cation students in age-graded, 
self-contained classrooms. About 
75 percent of the teachers worked 
with regular education students; 
the other 25 percent worked with 
Title 1 and special education stu- 
dents outside the regular classroom 
(figure 3, line e). Because all of these 
schools are in urban areas with high concentrations 
of students in poverty, even the traditional schools 
were using at least some of their Title 1 resources to 
add regular classroom teachers. Thus, their regu- 
lar education class sizes averaged between 19 and 
22 (line b). Class composition and class size stayed 
the same all day, for all subjects, except when stu- 
dents were pulled out for special education or Title 
1 instruction. The elementary classroom teacher 
instructed all subjects except specialties like art, 
music, and gym which were taught by specialists 
during the classroom teacher's free period. Teach- 
ers had 45 minutes 3 to 5 times a week free from 
instruction in addition to short lunch periods. These 
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times were not coordinated with other teachers in 
any systematic way. 

Reduction of Specialized Programs 

In departing from this organization, all of the 
sample schools increased to above 90 percent the 
percentage of teachers who worked with heteroge- 
neous groups of students (figure 3). The compa- 
rable percentages in the traditional schools ranged 
from 28 to 77 percent. The only teachers in the re- 
structured schools not working with heterogeneous 
groups of students were teachers of special educa- 
tion students in separate classrooms at Quebec 
Heights. 

Each elementary school used different levers for 
matching instructional resources to student needs, 
depending on its educational goals. First, Quebec 
Heights decided to use multi-age grouping to re- 
spond more effectively to diversity 
in student skill levels. Students 
were assigned to multi-age clusters, 
called "families," containing three 
to four teachers and 75 to 85 stu- 
dents in grades 1-3 or 4-6; the fami- 
lies remain together for 3 years. Stu- 
dents may work with any instructor 
within the family during the day 
but each has a homeroom teacher 
who has primary responsibility for 
a class of 22 students for the full 
year. Rather than divide the cur- 
riculum by age level, all students in 
the family study the same basic top- 
ics during the year, but at their own 
developmental level. 

Second, Quebec Heights eliminated separate 
Title 1 programs and used these resources to reduce 
the size of reading groups for all students. Third, 
special education students and teachers were fully 
integrated into the families. In the primary grades, 
the special education resource teacher works as 1 of 
4 teachers in a team responsible for a group of 85 
students. 
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Because all of these 
schools are in urban 
areas with high concen- 
trations of students in 
poverty, even the 
traditional schools were 
using at least some of 
their Title 1 resources 
to add regular class- 
room teachers. 
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Figure 3. — High performing versus traditional elementary schools 
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SOURCE: Unpublished tabulations. 



The Douglass Elementary School in Memphis 
used its Title 1 budget as the primary lever for re- 
thinking resources to improve student performance. 
Because 97 percent of its students qualify for Title 1 
assistance, Douglass has long been free to use Title 
1 dollars across the school. At approximately 
$250,000 dollars per year, these resources represent 
nearly 20 percent of the school budget. Douglass 
restructured resources using an existing model for 
improving student performance, the "Success for 
All" program. Following this model, Douglass uses 
Title 1 dollars to hire reading teachers to work as 
one-on-one tutors with students who do not meet 
reading standards in the first and second grades. 
These teachers and all special education teachers 
combine with regular classroom teachers to reduce 
the size of instructional groups from 24 to about 17 
for 90 minutes of reading a day for all students. 

The Douglass example illustrates why simple 
measures of class size do not provide enough infor- 
mation about how resources follow instructional 



goals. Prior to implementing "Success for All," 
Douglass used Title 1 dollars for regular classroom 
teachers and had classes averaging 17 across the 
school. As Principal Myra Whitney commented, 
"We had slowly reduced all class sizes over the years 
with no plan for how anything in the classroom 
would change. It wasn't working, our students were 
still at the bottom in reading." To implement Suc- 
cess for All, Douglass raised class sizes for all other 
subjects in order to reduce group sizes for reading 
and provide targeted one-on-one tutoring assistance 
to ensure all students are reading by third grade. 
Douglass also redirected resources from grades 3-6 
to the early grades. The decision to take resources 
away from some students and teachers to focus on 
others can produce tension. Douglass's use of a 
proven model that included clear staffing require- 
ments minimized this friction. As one teacher put 
it, "Everything is specified by 'Success for All;' we 
didn't consider quarreling with it because research 
shows this works." 
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Douglass used "Success for All" as a catalyst for 
including its special education teachers and students 
in regular classrooms. By the third year of the pro- 
gram, all students and teachers from previously 
self-contained classrooms and resource rooms spent 
most of their time in heterogeneous groups. Dur- 
ing the daily 90 minutes of reading time, special 
needs students worked in heterogeneous groups 
based on their reading skill levels. The integration 
of special education students was made easier by 
the fact that cooperative learning plays a large role 
in "Success for All" classrooms. Assigning special 
education teachers their own reading groups that 
included students from all programs further re- 
duced the size of reading groups for all students. 
During most of the rest of the day, special educa- 
tion teachers team taught with regular education 
teachers. They used approximately one-quarter of 
their time for performing individual assessments 
and working with students needing 
more targeted help outside the 
regular classroom. 

While Quebec Heights rede- 
signed traditional age-grading prac- 
tices and Douglass rethought its use 
of Title 1 resources, the Mary C. 

Lyons school used the reallocation 
of special education dollars as a re- 
design lever. By including special 
education students, who were pre- 
viously educated in a private setting 
at a cost of over $30,000 each per 
year, with regular education stu- 
dents, Mary C. Lyons created a 
unique, individualized environ- 
ment for students and teachers. Mary C. Lyons is 
open to all students from 7:15 a.m. to 5:00 p.m. daily. 
Each classroom had 15 or fewer students — 10 "regu- 
lar" education students and 5 students with severe 
emotional /behavioral issues — and was staffed by 
a teacher, a teacher intern, and an after-school 
teacher. The pairs of six classroom teachers and six 
teaching interns included three teachers with regu- 
lar education certification and three with special 
education certification. This unusual integration of 
special education students and teachers is driven 
not by finances but by a belief that schools must meet 
children's academic and emotional needs at their 
level of development. The teaching staff is hired to 
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ensure attitudes, skills, and expertise to meet a broad 
range of academic, social and behavioral consider- 
ations. They work closely as a team to analyze the 
effectiveness of their instructional efforts. Academic 
teachers have close to two hours daily of common 
planning time. 

Virtually all teaching resources at Mary C. Lyons 
supported this design, including Title 1 funds and 
funds which would have paid for subject special- 
ists in traditional schools. A typical Boston elemen- 
tary school has four subject specialists (in art, mu- 
sic, physical education and computers) who supple- 
ment instruction and cover planning time for class- 
room teachers. Instead of this costly arrangement, 
Mary C. Lyons pooled these dollars to contract with 
outside teachers for the provision of art and music 
and part of the after school program. 

In summary, each elementary 
school used its resources from spe- 
cial programs to support its core 
design. Quebec Heights and 
Douglass raised regular education 
class sizes and redirected funds to 
reduce reading group sizes. Mary 
C. Lyons used funds freed from 
eliminating separate programs to 
lower teacher student ratios dra- 
matically all day. In each case, staff 
organization depended on the edu- 
cational strategies and approach the 
school had adopted. The organiza- 
tion of resources and the educa- 
tional goals in these schools were 
inextricably intertwined because 
the organization enabled the schools to implement 
new teaching strategies. 

More Flexible Student Grouping 

Perhaps the most striking difference between the 
sample elementary schools and traditional schools 
is the proactive, strategic way in which teachers 
adapted instructional grouping to student needs. In 
a traditional school, administrators assign students 
to programs and classrooms according to bureau- 
cratic rules and categories that stay constant over 
time and subject. Teachers in sample schools used 
their knowledge of student needs, rather than a 
student's program classification or age, to assign 

V6 45 



(E)ach elementary 
school used its re- 
sources from special 
programs to support its 
core design (and) . . . 
staff organization 
depended on the 
educational strategies 
and approach the 
school had adopted. 




Developments in School Finance , 1997 



each to a regular homeroom class and to manage 
their instruction throughout the day. 

In sample schools, reading groups were signifi- 
cantly smaller than in traditional schools. Mary C. 
Lyons and Quebec Heights organized staff to allow 
groups of six and seven, respectively Mary C. Lyons 
used the classroom teacher and teaching intern to 
create reading groups of six. Quebec Heights ro- 
tated Title 1 teachers and instructional assistants 
through regular classroom, so that each classroom 
had three instructors for 90 minutes of reading time 
per day. The primary classroom teacher determined 
the composition of the groups and content of les- 
sons daily based on consultation with the expert 
reading teachers and review of students' progress 
in specific areas. Some lessons divided students into 
groups based on areas where they need further skill 
development, others grouped students heteroge- 
neously to discuss reading content. 

This concentration of resources on 
the reading rotation meant that 
homeroom class sizes were one stu- 
dent larger on average than the tradi- 
tional model. 

At Douglass, all students spent 
90 minutes per day in reading 
groups of 15-17, as compared with 
22 in traditional schools. The com- 
position of these reading groups var- 
ied each day and over the course of 
the year depending on the teachers' 
assessment of student needs. A team 
including the teachers, reading spe- 
cialists, and the "Success for All" fa- 
cilitator assigned students to skill-based reading 
groups across grades using formal assessments ev- 
ery 6 weeks. Since assignment to groups indicated 
skill level, as opposed to age or a static assignment 
of aptitude, the student moved on once he/ she dem- 
onstrated these skills. Students not mastering skills 
by agreed upon times received one-on-one tutoring 
for 20 minutes each day from one of the three read- 
ing specialists. About 15 percent of first and second 
grade students received tutoring at any one time, 
but the students receiving tutoring varied over the 
year depending on who needed extra assistance in 
particular skill areas. 



This continuous assessment and regrouping of 
students required significant time and joint effort. 
A full time "Instructional Facilitator" helped teach- 
ers to conduct, analyze, and act on the assessments. 
The facilitator received in-depth training for using 
"Success For All" reading assessment tools and 
worked with a district wide expert in "Success For 
All" who had further expertise. In pulling this fa- 
cilitator from the classroom, Douglass once again 
traded general regular education class sizes for a 
strategic use of resources which supported their 
school design. In this case, the facilitator enabled a 
more careful matching of instruction to student 
needs as well as more effective use of joint planning 
time. 

Traditional schools experience variations in class 
sizes driven by formulas and enrollment swings. 
Boston's school choice plan allowed Mary C. Lyons 
to cap the number of students by 
grade through the student assign- 
ment process. Mixed age group- 
ing at Douglass and Quebec 
Heights allowed teachers to con- 
trol group sizes. For example, if 
Douglass had used age-based 
grading, class sizes in the first and 
second grade would have been 24 
and 26, respectively, with class 
sizes declining as the student 
moved toward sixth grade. In- 
stead, the Douglass staff combined 
grades to create smaller groups of 
23 in the first three grades and 26 
in the intermediate grades. Thus, 
sample schools exerted more con- 
trol over class sizes by combining ages and programs 
so that all students were in targeted class sizes rather 
than the 60 to 65 percent who would have been in 
such group sizes using traditional age grading. 

Structures to support more personal 
relationships 

Quebec Heights' family structure aimed to 
strengthen relationships between teachers and stu- 
dents by keeping teachers with the same family of 
85 students for 3 years, usually with the same 
homeroom class. This meant that some teachers re- 
ceived as few as nine new students each year. As an 



Perhaps the most 
striking difference 
between the sample 
elementary schools 
and traditional schools 
is the proactive, strate- 
gic way in which 
teachers adapted 
instructional grouping 
to student needs. 




Rethinking the Allocation of Teaching Resources 



intermediate teacher stated, "It's hard to overesti- 
mate how much time this saves us. We get started 
quickly in the new school year, students know the 
rules and boundaries, and I know what they can 
do." 

The Mary C. Lyons school's small size and in- 
tense staffing ratios created a highly personal envi- 
ronment for all students. The staff also created time 
to discuss each student's progress as a team. All the 
professionals working with each group of stu- 
dents — the classroom teacher, intern, a special edu- 
cation evaluation specialist, the afterschool direc- 
tor, and a social worker — met weekly for 45 min- 
utes. Together, they identified problems, discussed 
possible strategies, and shared successes and frustra- 
tions. 

More Common Planning Time 

All three schools created more 
common planning time, but the con- 
straints of time and collective bar- 
gaining agreements meant that only 
Mary C. Lyons made dramatic 
changes (figure 3). Both Douglass 
and Quebec Heights increased com- 
mon planning time using the con- 
ventional method of scheduling spe- 
cialists to allow meeting time for 
small groups of teachers. Mary C. 

Lyons' academic teachers shared a 
30 minute lunch period followed by 
1 hour and 15 minutes of common 
planning time while students had 
lunch and recess and received in- 
struction from their instructional intern and 
after-school teacher. In combination with the "stu- 
dent support" team meetings described above, 
teachers met together for a total of 405 minutes 
weekly, in contrast to no more than 60 minutes in a 
traditional school. 

Creative Definition of Staff Roles and Work Day 

The Mary C. Lyons school was able to create 
so much more planning time because it redefined 
teaching roles throughout the day. Whereas in a tra- 
ditional school only the classroom teacher or sub- 
ject specialists assume responsibility for classroom 
instruction, Mary C. Lyons had a master teacher and 



a highly trained and supervised "instructional as- 
sistant trainee" in each classroom. In contrast to 
often poorly trained paraprofessionals, the Mary C. 
Lyons "trainees" were college educated students 
working on their master's degrees in special educa- 
tion at Wheelock University. To do this, Mary C. 
Lyons negotiated with the Boston Teachers Union 
to convert paraprofessional slots to "instructional 
assistant trainee" positions. As part of their pro- 
gram, the Wheelock students work in schools for 
stipends of $10,000 per year and participate in in- 
tensive coursework over holidays and summer. 
Wheelock sends a faculty member every two weeks 
to observe and discuss the trainee's practice with 
the master teacher. The trainee's stipend costs less 
than the $18,000 in salary and benefits for a para- 
professional. The savings allowed the school to give 
each teacher an "instructional assistant trainee." 
Where possible, the new instructional assistants 
were recruited from the existing 
paraprofessional staff. While the 
trainee position represented a 
short term cut in pay, this position 
led to full-fledged certification as 
a special education teacher for 
these staff. 

In addition, Mary C. Lyons 
contracted with teachers to cover 
"schoolwide" planning time. The 
"afterschool" teachers overlapped 
the regular school day by one hour. 
During this time, they managed 
the classroom along with the 
instructional assistant trainee. In 
addition to providing regular 
teachers with planning time, this overlap provided 
a chance for after school teachers to transition from 
the regular academic day along with someone who 
had been with the students all day, thus allowing 
more continuity and better care for the children. The 
eight afterschool teachers, who were provided 
through a contract with a private nonprofit organi- 
zation, specialized in behavior management and 
brought a wide range of experience with emotion- 
ally disturbed as well as gifted students. The prin- 
cipal worked closely with the contractor to specify 
the qualifications of these teachers, and the contract 
was contingent on the hiring of such exceptional 
teachers. 



...Lyons school’s 
small size and in- 
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— personat environment 
for all students. The 
staff also created 
time to discuss each 
student’s progress as 
a team. 
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Secondary Schools 

The traditional high school, with its departmen- 
talized instruction and highly-segmented school 
day, offers many more opportunities for rethinking 
resource allocations than do elementary schools. We 
examined a typical comprehensive high school in 
New York City serving about 3,300 students. 3 The 
school is considered a "good" school serving a 
largely middle- and working-class population of 
students. It had nearly as many special needs and 
Title 1 students as CPESS and was in the process of 
beginning to restructure its programs. At the time 
of the study, however, it used traditional staffing and 
scheduling practices. 

As figure 4 shows, the sample high schools 
looked very different from the traditional high 
school on virtually every dimension measured. Al- 
though our analysis is focused on 
the use of instructional staff, it is 
worth noting that the traditional 
high school had many more 
non-teaching staff than the two re- 
structured schools. Not including 
custodial and food service workers, 
more than 40 percent of its total staff 
had non-teaching assignments. 

These included 1 principal, 9 assis- 
tant principals, 13 secretaries, 10 
school-based services specialists 
(social workers, psychologists, etc.), 

3 librarians, 17 security guards, 22 
non-teaching school aides, and 14 
classroom-based paraprofessionals. 

In the restructured schools, just over 
25 percent of staff had non-teaching assignments 
and most of them taught at least part-time 
(Darling-Hammond, 1997). 

The traditional high school had one instructional 
staff person for every 14.7 students — and New York 
City staffing allocations would reduce the student 
load to 13 for a population of students like that 



at International High. 4 Because fewer than two- 
thirds of these instructional staff members taught 
full-time, however, class sizes averaged about 33. 
Special education, bilingual education, English as a 
Second Language, and Title 1 programs were ad- 
ministered separately, with generally smaller class 
sizes. The typical student attended school from 8:05 
a.m. to 2:13 p.m. participating in seven different 42- 
minute classes with seven different teachers, plus 
one lunch period. Teachers taught five instructional 
periods a day, with two periods free from in- 
struction: one used for planning and the other for 
rotating "building assignments" such as cafeteria 
duty or hall duty or other administrative or pro- 
gram responsibilities. Excluding these special du- 
ties, teachers routinely saw about 167 students per 
day. 

By contrast, the two sample high schools began 
with resources roughly similar to 
the traditional school and ended 
with dramatically smaller group 
sizes and teacher loads. As noted, 
the first difference was allocating a 
greater share of resources to instruc- 
tional staff rather than administra- 
tive and support staff. The second 
was assigning almost all instruc- 
tional staff to work directly with 
students. As a consequence, all stu- 
dents experienced much smaller 
class sizes (18 at CPESS and 25 at 
International High), while their 
teachers also had much more plan- 
ning and professional development 
time. Because teachers taught a 
smaller number of longer periods, pupil loads were 
also reduced: teachers at CPESS saw 36 students and 
those at International High saw 75 students in a 
given term. The schools achieved this by reducing 
specialization, reorganizing student groups and 
teaching structures, and investing heavily in pro- 
fessional development. The strategies the two 



The traditional high 
school, with its depart- 
mentalized instruction 
and highly-segmented 
school day, offers many 
more opportunities for 
rethinking resource 
allocations than do 
elementary schools. 



3 In a related study (Darling-Hammond, 1997), Darling-Hammond and colleagues also examined a smaller, more affluent 
suburban high school of about 1,600 students. However, the large difference in student populations between this school and 
the redesigned schools in New York City made this school inappropriate for the current analysis. 

4 Because International High has a unique student population comprised of 90 percent Title 1 eligible and 100 percent limited 
English proficient students, an analogous traditional school could not be found for comparison. Instead, we used the New 
York City staffing guidelines, as outlined in the New York City publication Comparative Analysis of the Organization of High 
Schools , 1992-93, to estimate staffing for students identified for special needs programs. 
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Figure 4.— High performing versus traditional secondary school 



Resource allocation 
principles 


Central Park 

School measure East Elementary 


International 

High 


Traditional 


Reduction of specialized 
programs to provide more 


a. Students per instructional 
staff member 


10.2 


10.2 


14.7/13* 


individual time for all in 


b. Students per full-time teacher 


13.3 


15.8 


23.6 


heterogeneous groups 


c. Average size of regular 
instructional group 


18 


25 


33.4 




d. Percent teachers in regular 
instructional groups 


89% 


100% 


70% 


More flexible student grouping 


e. Percent students in target size grouping 


100% 


100% 


60% 


by school professionals 


f. Average size of advisory group 


15 


12 


29 


Structures to create more 


g. Student loads per term 


36 


75 


(home 

room) 

167 


personalized environments 


h. Percent professional staff serving 
as instructors/advisors 


100% 


100% 


65% 


Longer and more varied 
blocks of instructional time 


i. Average length of instructional 
period (in minutes) 


120 


70 


42 


More common planning time 


j. Common planning minutes/week 


450 


350 


0 


for staff 


k. Length of longest planning period 
(in minutes) 


120 


140 


42 



* A traditional high school that had a 100 percent Limited English Proficient student population like that at International would 
receive additional staff to reduce its student/teacher ratio for those students to 13:1 . 



SOURCE: Unpublished tabulations. 



schools used reflected their instructional purposes 
and philosophies. 

Reduced Specialization 

CPESS reduced specialization in a host of ways 
in order to create smaller teacher-student loads and 
focus resources on academic subjects. CPESS fol- 
lows the principles embraced by the Coalition of 
Essential Schools, one of which is that "less is more." 
Instead of aiming for broad coverage of content, 
CPESS has organized its curriculum around five 
"Habits of Mind" which encompass the abilities to 
weigh evidence, take varying viewpoints into ac- 
count, see connections and relationships, speculate 
about possibilities, and assess value. These goals 
are reinforced in every course and in a comprehen- 
sive portfolio assessment system. The school con- 
centrates its resources on a common core curricu- 
’um in grades 7-10, and uses college and commu- 
/ / 



nity resources to expand curriculum options in the 
upper grades. 

At the time of the study, all students took aca- 
demic subjects in heterogeneous groups of about 18. 
Students in divisions I and II (grades 7-10) took two, 
two-hour academic courses each day, humanities 
and math/science plus a foreign language offered 
in a shorter period before school. All full-time teach- 
ers in these grades, with the exception of two spe- 
cial education resource room teachers, taught one 
of the two interdisciplinary courses. The resource 
room teachers helped students with their regular 
classroom work, thereby reinforcing rather than 
fragmenting students' learning. Students and teach- 
ers told us that the extraordinary outcomes achieved 
at CPESS were made possible by these small class 
sizes and the continuity provided by (a) having a 
team of teachers stay with the same students for two 
S 9 
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years (b) providing extra help to special needs stu- 
dents connected to their regular classroom work. In 
addition, teachers were grateful for opportunities 
to extend their disciplinary knowledge by planning 
together in subject area teams and to extend their 
knowledge of students by also working together in 
"house" teams organized around the teaching of 
shared groups of students (Darling-Hammond, 
Ancess, and Falk, 1994). 

In the Senior Institute (grades 11-12), the school 
reduced its need for specialization by working out 
advanced course-taking opportunities for students 
at local colleges to supplement required courses and 
electives offered in the school. All students took at 
least two college courses during their last two years 
of high school, along with an internship in a local 
business or community organization. These strate- 
gies expanded students' academic and vocational 
opportunities while freeing staff 
time for the one-on-one advisement 
needed to support students in com- 
pleting extensive research projects 
and the other portfolio entries they 
would defend before their gradua- 
tion committees. 



The existence of 



clear school goals 
and a consensus 
about strategies 
enhanced Interna- 



tional High’s ability to 
design a coherent, 
carefully configured 
organization. 



Language instruction and some 
electives were provided through 
outside contracts. Although stu- 
dents could opt to take more ad- 
vanced courses in specific subjects, 
there was no tracking, no separate 
Title 1 programs, and no separate 
bilingual program. Instead of hir- 
ing guidance counselors, each 
teacher was responsible for counseling 12 students 
during scheduled advisory periods. The small size, 
personalization, and team organization also elimi- 
nated the need for attendance officers, deans of dis- 
cipline, assistant principals, and supervisors, roles 
that deflect resources away from teaching positions 
in traditional high schools. 

The organization of resources at International 
High also followed its educational mission — the 
education of recent immigrants — and its philosophy, 
which includes the following principles: 



• Language skills are most effectively learned 
in context and when embedded in a content 
area. 

• Successful educational programs emphasize 
rigorous standards coupled with effective 
support systems. 

• Attempts to group students homogeneously 
preclude the way in which adolescents learn 
best (i.e., from each other). 

• The carefully planned use of multiple learn- 
ing contexts in addition to the classroom (e.g., 
learning centers, career internship sites, field 
trips) facilitates language acquisition and con- 
tent area mastery. 

The existence of clear school goals and a con- 
sensus about strategies enhanced International 
High's ability to design a coherent, carefully con- 
figured organization. The school re- 
organized its programmatic re- 
sources around 12 interdisciplinary 
themes. Six self-managing instruc- 
tional teams, called "clusters," were 
composed of four to six teachers 
plus guidance and para professional 
staff who developed two themati- 
cally based courses of study (e.g.. 
Motion, Visibility), which inte- 
grated four subject areas (e.g., litera- 
ture, global studies, mathematics, 
and physics). The team took re- 
sponsibility for the total educational 
experience of about 75 students 
during a 13-week course of study. 
Students chose three of these the- 
matic courses each year. 



All teachers taught heterogeneous groups of stu- 
dents that included all native languages and all 
grades, economic levels and ability levels. The fac- 
ulty integrated English as a Second Language (ESL) 
techniques into their content-area courses while pro- 
viding students with opportunities to further de- 
velop their language skills with instructors outside 
the core curriculum and in learning contexts such 
as internships outside the school. The success of 
this strategy is illustrated by the fact that virtually 
all International High students pass the New York 



ERJC. 
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State competency tests in English as well as other 
subjects, and virtually all are accepted to college 
prior to graduation. Teachers designed the cluster 
schedule because they felt that planning and teach- 
ing in interdisciplinary teams would enable them 
to have more influence on student learning. They 
found that the school's first cluster experiment pro- 
duced much greater levels of student success than 
had independent classes, because teachers' efforts 
were jointly planned and cumulative, and teachers 
could deal with students' needs and problems in a 
concerted fashion (Darling-Hammond, Ancess, and 
Falk, 1995). 

In both schools, the integration of previously 
specialized resources, along with the investment of 
greater resources into teaching positions rather than 
nonteaching positions, translated into much lower 
pupil loads and more opportunity for individual 
student attention than in the traditional school. In 
addition to focusing resources on instructional po- 
sitions, the sample schools used most of their teach- 
ing resources in one core academic program in 
which all students participated, rather than using 
special program resources for add-on remedial or 
special education programs. CPESS used 89 per- 
cent of teaching resources in its core instructional 
program, while International High used all staff in 
the core program, as compared with roughly 70 per- 
cent of teachers working in regular instruction in 
the traditional high school. 

Smaller class sizes were also achieved by creat- 
ing a broader role for professional staff in the re- 
structured schools, rather than using specialists to 



perform "non-classroom" functions. Staff acknowl- 
edged this tradeoff in a set of "understandings that 
underlie professional staff work at CPESS" which 
includes the following statement: 

In return for smaller class sizes (maximum 
20) and smaller total student rolls, teachers 
will work with students for a total of 22 hours 
per week in classes, advisories, or tutorials; 
conducting seminars; overseeing projects; giv- 
ing lectures; or advising and coaching indi- 
vidual students (CPESS, 1991). 

More Flexible Student Grouping 

Reducing the number of programs, courses, and 
levels made it easier for the sample schools to con- 
trol the size of instructional groups. As table 1 
shows, although 64 percent of all classes in the tra- 
ditional high school had 29 to 34 students, 21 per- 
cent of classes were smaller than 25. Class sizes were 
higher in regular education academic classes than 
in nonacademic classes. 5 In contrast, CPESS and In- 
ternational High place all of their students in target 
size groups. 

Even more flexible grouping strategies were 
found in CPESS's Senior Institute (grades 11-12), 
where teachers and students focused substantial at- 
tention on preparing the graduation portfolio and 
applying to colleges. Time was allocated to allow 
teachers to provide coaching and support for inde- 
pendent study. A typical teacher would teach two 
classes over about 12 hours per week. He or she 
would also spend 4 to 5 hours per week supervis- 
ing independent projects, another 4 to 5 hours in 



Table 1.— Class sizes in the traditional high school 


Class size 


Academic 


All classes 


0-19 


6 percent 


8 percent 


20-24 


7 percent 


13 percent 


25-28 


. 1 3 percent 


13 percent 


29-34 


72 percent 


64 percent 


Over 34 


3 percent 


3 percent 


SOURCE: Unpublished tabulations. 



5 New York City Schools, Comparative Analysis of the Organization of High Schools, 1992-93, p. 82-92. 
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advisory working with his or her 12 advisees on aca- 
demic and personal concerns, and another 3 1/2 
hours per week for one-on-one help to students. 
Class periods varied in length depending on their 
purpose. While students took courses and com- 
pleted internships outside the school, teachers got 
time to work and plan together. 

Structures to Create Personal Relationships 

The sample schools were organized more like 
elementary schools than most secondary schools, 
featuring small teaching units and closer, more sus- 
tained relationships between teacher and student. 
In addition to the smaller class sizes, both sample 
schools used "advisory groups" as a key strategy 
for maintaining ongoing relationships with students. 
Each professional staff member worked with a 
group of 12 to 15 students and their families. The 
use of all professional staff, not just teachers, allowed 
advisory groups to be smaller than 
average class sizes. Advisory 
groups met for approximately 4 
hours a week at CPESS, and for 
about 2 hours weekly at Interna- 
tional High. Teachers and advisors 
used the time for individual study; 
to discuss health, social and ethical 
issues; and for individual and group 
advising and counseling. The ad- 
visor served as the "expert" on the 
student, meeting regularly with the 
family and other teachers to discuss 
the student's needs and progress, 
coordinating parent conferences 
and the preparation of narrative as- 
sessments of student work. 

Whereas all professionals in the two restructured 
schools worked on a regularly scheduled basis with 
groups of students, only 65 percent of the pro- 
fessional staff at the traditional high school had regu- 
larly scheduled contact with a continuing group of 
students. While guidance counselors and other sup- 
port personnel worked intensively with some stu- 
dents, they did so on a reactive, usually sporadic 
basis that was not designed to create close, long term 
relationships. 



Longer and More Varied Blocks of Instructional 
Time 

In contrast to the traditional high school's seven 
42-minute periods each day, both restructured high 
schools created longer periods and more flexible 
schedules to accommodate more ambitious kinds 
of work and to allow more time for teachers' to sup- 
port student learning. At CPESS, students in grades 
7-10 had two 2-hour blocks of humanities and 
math/ science each day. Since these two teachers 
worked together as a team, they could vary the split 
of time between the two to accommodate daily les- 
son plans. In addition, one morning per week stu- 
dents spent 2 1/2 hours in a community service 
project while their teachers were engaged in cur- 
riculum planning. Other coursework, such as lan- 
guage instruction, took place in smaller (usually 1 
hour) blocks of time. In the Senior Institute, classes 
varied in length from 1 to 2 hours, while advise- 
ment sessions, internships, and in- 
dependent work time were sched- 
uled for longer blocks of time to 
allow students to undertake ex- 
tended work with adequate coach- 
ing and time for research. 

At International High, students 
typically had four courses each of 
which met for 70 minutes four 
times per week. They also had a 2 
hour internship and an hour long 
seminar each week. Because each 
cluster of four teachers controlled 
their shared students' entire time 
schedule during a 13- week cycle, 
they could vary time across classes 
each day as needed for the work in which students 
were engaged. 

More Common Planning Time 

Both sample high schools created structures that 
demand and allow much more common planning 
time. In addition to individual "prep" time, CPESS 
teachers spent on average 7.5 hours per week in 
scheduled common planning time. CPESS used four 
strategies to create this time. First, teachers met with 
their disciplinary teams for 2 1/2 hours of weekly 



. . . both restructured 
high schools created 
longer periods and 
more flexible sched- 
ules to accommodate 
more ambitious kinds 
of work and to allow 
more time for teach- 
ers’ to support student 
learning. 
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curriculum planning while students were in com- 
munity service placements. Second, while teach- 
ing fellows and other professionals provided cov- 
erage, teachers had from 1 1 /2 to 3 hours each week 
to meet with fellow "house" teachers and with stu- 
dents individually. Third, students' hours were in- 
creased during the week so they could be dismissed 
at 1:00 p.m. on Fridays to create time for a weekly 2 
hour staff meeting. Finally, as its governance plan 
states, "the full staff agrees to meet during hours 
when the students are not in attendance to com- 
plete necessary business." In addition to the Fri- 
day meeting, teachers attended a regular Monday 
meeting from 3:00 p.m. to 4:30 p.m. 

At International High, teachers had 140 min- 
utes each week to plan with their cluster while stu- 
dents participated in college courses and other ac- 
tivities. During a half day each week set aside for 
club activities for students, teach- 
ers staff-initiated professional de- 
velopment. In addition, teachers 
had a 70 minute individual plan- 
ning period each day which often 
coincided with that of other mem- 
bers of their team. These models 
offer stark contrast to the traditional 
high school model in which teach- 
ers had one or two separate 42- 
minute periods free from instruc- 
tion, one often devoted to nonaca- 
demic duties and the other orga- 
nized as an individual preparation 
period. 

Creative Definition of Staff Roles 
and Work Day 

Both of the sample schools have made many 
changes to the roles of teachers and the typical or- 
ganization of the teacher work day in ways that 
enable greater personalization for students. They 
focused teaching resources on core academic sub- 
jects by contracting with outside providers for elec- 
tives and non-academic subjects. CPESS increased 
resources for teachers by incorporating counseling 
and advising into the teaching role. In both schools, 
teachers are involved in curriculum and assessment 
development, hiring and evaluation of staff, 
schoolwide decisionmaking, and staff develop- 

O 




ment. These broader professional roles not only re- 
duce the need for nonteaching specialists assigned 
to manage and oversee teaching, they also enrich 
teachers' knowledge and skill by giving teachers 
continuous opportunities to reflect on their teach- 
ing and learn from one another, thus expanding the 
expertise available in the school (Darling- 
Hammond, 1997). 

Policies, Regulations, and Contract- 
ual Issues 

To accomplish these things, the sample schools 
directly challenged policies, regulations, and collec- 
tive bargaining agreements. First, most of the 
schools changed the contractually defined teacher 
work day and contractual rules for such matters as 
seniority transfers. Second, in breaking down bar- 
riers between programs, age groupings, and sub- 
jects, they confronted staffing for- 
mulas, program administration 
rules, and, sometimes, teacher li- 
censing categories. Third, many of 
these schools redefined both teach- 
ing and non-teaching positions to 
create new jobs which do not fit 
neatly into existing contractually 
defined categories. 

Collective bargaining agree- 
ments in most districts clearly de- 
fine the teacher work day, outlining 
the hours teachers are required to 
work and limiting the number of 
required afternoon and evening 
meetings. Most go further to 
specify the number of minutes of time teachers must 
have free for lunch and planning activities, and some 
limit the number of hours in a row teachers can be 
involved in instruction, making it difficult to create 
connected blocks of planning time. It is easy to un- 
derstand the reasons for these provisions, but it is 
also clear that new strategies are needed for schools 
in which teachers jointly develop curriculum and 
manage their own and students' time. 

As teaching jobs are broadened, schools can run 
into state, district, and collective bargaining restric- 
tions. Using teachers across subjects or programs 



Both of the sample 
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teaching resources on 
core academic sub- 
jects by contracting 
with outside providers 
for electives and 
non-academic sub- 
jects. 
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can require waivers. Mary C. Lyons use of three spe- 
cial education teachers and three regular education 
teachers to teach integrated classrooms of special 
needs and regular education students required waiv- 
ers from the Boston teachers contract and Massachu- 
setts state certification laws. The principal argued 
that she knew how to identify individuals with ex- 
perience and disposition to handle both special edu- 
cation and regular education students. She devel- 
oped a plan to create a team structure which took 
advantage of teachers' diverse skills and a profes- 
sional development plan for each teacher and for the 
entire school, so that they would develop the skills 
they needed. 

Schools also can run into certification problems 
in moving to interdisciplinary instruction, because 
many collective bargaining agreements and state 
regulations require teachers to hold certification in 
more than one subject to teach hu- 
manities or math/ science in middle 
or high schools. Finding individu- 
als with the subject and pedagogi- 
cal knowledge to combine these 
subjects effectively is obviously 
critical to successful interdiscipli- 
nary instruction. Although certifi- 
cation in both fields is one indica- 
tor of this ability, it is not the only 
means for developing expertise in 
a second field. At CPESS, teachers 
with a background in one field plan 
in curriculum teams (a math /sci- 
ence team and a humanities team) 
that provide the additional disci- 
plinary expertise they need to 
handle the breadth the core courses require. 

New job positions and hiring arrangements also 
confront some collective bargaining agreements and 
traditional allocation guidelines. For example, Mary 
C. Lyons and CPESS created a different kind of In- 
structional Assistant by using teaching interns— 
graduate students who are preparing to become 
teachers — instead of paraprofessionals or untrained 
support staff. In addition, three of the sample schools 
received waivers from collective bargaining agree- 
ments to use outside contractors for specific kinds 
of instruction. 



Selection and retention of teachers with the re- 
quired qualities and experience to match these 
school designs is critical to their success, yet many 
districts treat staff as interchangeable when they 
make assignments and move staff on seniority trans- 
fers. Some districts have solved this problem by 
creating alternative personnel tracks for specially 
designated 1 schools. Cincinnati has done this for 
Paidea and Montessori schools. In Boston, schools 
negotiate control over the hiring process on a posi- 
tion by position basis. In New York, recent contract 
negotiations have allowed for teams of teachers, in- 
cluding principals and union representatives, to se- 
lect their new colleagues in the growing number of 
schools that have a distinctive missions. With the 
recent creation of over 100 new small schools in the 
city joining the substantial number of longer stand- 
ing alternative schools, this provision paves the way 
for widespread use of new staffing models. 



Finding individuals with 
the subject and peda- 
gogical knowledge to 
combine these subjects 
effectively is obviously 
critical to successful 
interdisciplinary instruc- 
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Finally, teacher contracts, dis- 
trict policies and state regulations 
often define class size maximums 
by program, grade level, and some- 
times subject. State guidelines 
specify the size of classroom for stu- 
dents at each level of special edu- 
cation classification. Schools can 
depart from these regulations if par- 
ents, teachers and special education 
professionals agree to an individual 
education plan that educates the 
student in a larger, more inclusive 
setting. Designs like that used at the 
Mary C. Lyons school require in- 
tense communication with students 
and parents to create understanding of the new ap- 
proach and strategies to insure appropriate addi- 
tional support for the students. They also demand 
that state and district officials work with schools to 
allow educationally sound designs. 

District student and teacher assignment policies 
can frustrate attempts to use teachers differently. In 
the sample districts, schools moving students from 
more restrictive special education settings into the 
regular classroom sometimes faced a potential loss 
of teachers because special education staff were al- 
located based upon the number of students requir- 
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ing separate education. If schools integrate students 
back into the regular classroom and resources are 
therefore reduced, the regular teacher in whose class 
the special education student now spends most of 
his time receiyes no extra resources and no reduced 
student load. In these cases, schools can find that 
regular education classrooms grow more unruly and 
crowded while the case loads of special education 
teachers decline. Over time, schools should find 
ways of shifting resources back into the classroom 
without losing special education expertise. To re- 
spond to this problem, Boston has adjusted its staff- 
ing formula to allow schools to use the resources 
for special needs students in inclusive settings. 

Conclusion 

Although these five high performing schools 
look very different from one another, they have all 
redesigned the way they allocate 
teaching resources to meet student 
needs and to create the time teach- 
ers need to implement a new vision 
of schooling. They demonstrate 
how schools can reallocate resources 
to implement new designs. The 
framework presented here aims to 
provide researchers and prac- 
titioners with a way to examine sys- 
tematically the possibilities for re- 
allocation and to measure their im- 
pact. Changing school organiza- 
tions to better fit an instructional 
vision will require schools to con- 
front long traditions and a host of 
state, district, and union policies 
and practices that conflict with many of the changes 
outlined here. These barriers can loom large. But, 
the biggest constraint may be lack of vision about 
the concrete changes in school organization that can 
create a more professional organization and improve 
student achievement. This paper aims to provide 
some clear, detailed examples schools might use to 
develop such a vision, including goals for student 
achievement along with educational strategies and 
an organization to accomplish these goals. 



The variety of models presented here suggests 
that resource reallocation and the design of an in- 
structional vision and strategy are intertwined. 
There is little rationale for restructuring resources 
without an underlying educational design. At the 
same time, none of these models could have accom- 
plished their goals without making changes in the 
use of resources. As these models and others are 
tested against evidence of improved student perfor- 
mance, one could imagine states and districts work- 
ing with schools to adopt proven designs, through 
a conscious process of changing resource allocations, 
practices, and regulations at each level. As part of 
the process of choosing an appropriate design, 
schools might undertake a comprehensive review 
of how their practices, resources, knowledge and 
skills would need to change to implement a new 
model. Principles of resource allocation and indi- 
cators of their use could form the basis for tools 
which help schools and districts un- 
derstand their progress. Districts 
could then organize their work to 
support these plans and develop 
strategies for helping schools make 
changes. This would include 
changes in state and district poli- 
cies that may produce obstacles to 
alternative forms of organization. 

The schools studied here have 
only touched the surface of oppor- 
tunities for rethinking the way 
school resources are used; they 
have largely worked within exist- 
ing salary structures and have not 
much explored the use of tech- 
nology in the classroom. Nevertheless, they fore- 
shadow the many ways schools may rethink exist- 
ing resources to create more personalized education 
for students and more professional responsibility 
and growth for teachers. 



Changing school organi- 
zations to better fit an 
instructional vision will 
require schools to con- 
front long traditions and 
a host of state, district, 
and union policies and 
practices that conflict 
with many of the 
changes outlined here. 
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Financing Education in the 
District of Columbia from the 
Perspective of the Financial 

Authority 



Introduction 

The District of Columbia Financial Authority 
was created by the U.S. Congress in 1995 to repair 
the District of Columbia's failing financial condition 
and to improve the management effectiveness of 
government agencies. The Authority has identified 
public education, along with public safety and some 
activities in public works, as priority areas. 

In November 1996, the Authority overhauled the 
governance and administration of the District of Co- 
lumbia Public Schools (DCPS). After issuing a scath- 
ing report on the quality and management of the 
District's schools, the Authority removed the super- 
intendent and stripped the Board of Education of 
most of its power to control the schools. In their 
place, the Authority appointed a new Chief Execu- 
tive Officer and an Emergency Board of Trustees to 
run and oversee the schools. 

This article describes the condition of the 
District's public schools that caused the Authority 
to take such a bold action. Next, it outlines the 
unique nature of the financing of the District's pub- 
lic schools, and then briefly discusses higher educa- 
tion in the District and the Authority's efforts to ad- 
dress problems at the University of the District of 
Columbia. 



Dr. Joyce Ladner 

District of Columbia Control Board 



The Failure of the District of 
Columbia Public Schools 

In fall 1996, after extensive study by the 
Authority's staff and consultants and after conduct- 
ing several public hearings, the District of Colum- 
bia issued a report entitled Children in Crisis: A Re- 
port on the Failure of the D.C's Public Schools. The full 
text, as well as other D.C. Financial Authority infor- 
mation and reports are available on the Internet at 
www.dcfra.gov. 

In that report, the Authority concluded that the 
deplorable record of the District's public schools by 
every important educational and management mea- 
sure had left the system in a state of crisis. DCPS 
was simply failing in its mission to educate the chil- 
dren of the District of Columbia. In virtually every 
area, and for every grade level, the system failed to 
provide the District's children with a quality edu- 
cation and safe environment in which to learn. 

Abysmal Education Outcomes 

DCPS fails to teach its pupils even the basics of 
education. Asa result, the system's students score 
significantly lower on standardized academic 
achievement tests than their peers in comparable 
districts around the nation. 
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Figures 1 and 2 benchmark DCPS's performance 
against other school systems and figure 3 measures 
DCPS's performance by ward over time. Figure 1 
shows that the average SAT scores of students tak- 
ing the test in school districts surrounding D.C. and 
peer school districts across the country exceeded 
those of students attending D.C. public schools. 

Figure 2 shows that the performance of D.C. 
public school students on the National Assessment 
of Educational Progress (NAEP) lagged behind both 
the average scores of students at the national level 
and those of students in the northeastern states. In 
1994, only 22 percent of DCPS's fourth-grade stu- 
dents scored at or above the basic 1 level — a decrease 
of 6 percent from 1992. 

The results from Stanford Achievement Tests ad- 
ministered in May, 1997, indicate that there has been 
little improvement since 1994. Thirty-three percent 
of third-graders were below basic levels in both read- 
ing and mathematics. Twenty-nine percent of 
eighth-graders were below basic levels in reading, 
and 72 percent of eighth-graders were below basic 
levels in mathematics. 



Performance varies extensively among sections 
of the District of Columbia (wards); the least afflu- 
ent wards have experienced the greatest decline in 
test scores over the last 5 years. Figure 3 shows that 
while test scores in the more affluent sections of the 
District have remained the same or improved 
slightly, scores in the poorer wards have declined 
dramatically. Several schools in wards seven and 
eight have seen startling declines in test scores of 15 
to 20 percentage points or more. The low achieve- 
ment levels attest to the fact that thousands of chil- 
dren, especially those in the less affluent sections of 
the District, are not being taught the fundamental 
skills necessary to succeed after they leave DCPS. 

Drop Out Rates Are Too High 

As academic performance continues to decline, 
many students are dropping out or leaving DCPS 
for neighboring districts and private schools. Fig- 
ure 4 illustrates the dramatic migration of students 
out of the public school system — 40 percent of high 
school students either dropped out or left the 
District's public schools between 1989 and 1995. 



Figure 1. — Average composite SAT scores: 1995 




DC National Neighboring Fairfax Montgomery Peer Baltimore Boston Cleveland Milwaukee Philadelphia Newark 

average average County County average 



NOTE: Information is not provided for Chicago, New Orleans, and Detroit. The SAT is not widely taken in Chicago and New 
Orleans because the local colleges and universities do not require it. Information on SAT scores in Detroit was unavailable. 

SOURCE: Information reported by individual districts. 
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Basic" is defined as a partial mastery of knowledge and skills that are fundamental for satisfactory work at this grade level. 
Below basic" is defined as little or no mastery of fundamental knowledge and skills for this grade level. 
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Figure 3. — Changes in sixth-grade test scores, by District of Columbia wards: 1991 to 1996 



Percent 
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Ward 

NOTE: Average of reading and mathematics percentiles from the sixth grade. 
SOURCE: Office of Educational Accountability, District of Columbia Public Schools. 
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An increasing number of high school students 
left DCPS between 1993 and 1995. Figure 5 shows 
the annual attrition for the classes of 1993, 1994, and 
1995. On average, 24 percent of students left their 
class in 10th grade, 23 percent in 11th grade, and 6 
percent in 12th grade. Drop-out rates are even 
greater in the transition from elementary to the jun- 
ior high level as parents who can afford it send their 
children to private schools. 

Unsafe Environments Disrupt Learning 

The District's schools, like other urban environ- 
ments, have serious problems in terms of both se- 
curity and deteriorating facilities. 

A National Center for Education Statistics 
(NCES) survey of teachers revealed that more in- 
structors from DCPS than instructors from other 
states believed that a variety of serious problems 
affected their schools. Problems included: lack of 
parent involvement, unpreparedness to learn, tar- 
diness, disrespect for teachers, student absenteeism, 
student apathy, and verbal abuse of teachers (figure 
6 ). 



Compared with the national average, more 
DCPS high school teachers and students report be- 
ing threatened with violence. Figure 7 shows that 
according to the 1995 National Education Goals Re- 
port, 26 percent of DCPS teachers reported that they 
were threatened, injured, or physically attacked by 
a student in the past 12 months, compared with a 
national average of 14 percent. Other state-level vio- 
lence statistics are also bleak. 

The infrastructure of the District's public 
schools is collapsing. The alarming condition of fa- 
cilities leaves students exposed to discomfort and 
even to potential harm — boilers burst, roofs leak, 
firedoors stick, bathrooms crumble, and poor secu- 
rity permits unauthorized individuals to gain ac- 
cess, threatening the safety of students. Such con- 
ditions make it almost impossible to focus on the 
primary mission of educating the children. Figure 
8 compares the condition of the District's schools to 
other schools in the nation. 



Figure 4. — Percentage of students beginning ninth grade who eventually left DCPS or who 
and graduated from DCPS: 1989-95 




SOURCE: District of Columbia Public Schools (DCPS) Dropout and Migration Statistics 1991-95. 
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Figure 5. — Enrollment changes for students in grades 8 through 12 in District of Columbia 
public schools (classes of 1993-95) 



Total number of students 





8th grade 9th grade 10th grade 11th grade 12th grade graduates 



SOURCE: District of Columbia Public Schools Dropout and Migration Statistics (1991-95). 



Figure 6. — Percentage of public school teachers perceiving issues as serious problems 
affecting their schools 
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Mismanagement Undermines Learning 

The inability of the District's schools to effec- 
tively implement long-term education and opera- 
tional plans leaves students without teachers or 
classrooms, textbooks unordered or lost in ware- 
houses, teachers untrained and uncertified, and stu- 
dents who are disabled without access. Addition- 
ally, poor resource allocation distorts priorities, en- 
suring that educational needs go unmet even when 
funds are available. 

Two critical areas that are mismanaged are per- 
sonnel and procurement. DCPS' personnel opera- 
tions are in disarray. Every aspect is problematic: 

• inability to identify how many employees 
work in the schools; 

• lack of adequate administrative controls; 

• incomplete planning, and 

• inability to understand or relate the impact 
of personnel on the educational mission. 

Procurement is just as bad. Poor contract man- 
agement has left an indelible mark on the District's 
children — who, among other things, have been 



forced to eat cold cereal for lunch and have been 
subjected to unqualified individuals operating 
school facilities. 

Basic data are also not available. The most criti- 
cal is that the District public school system does not 
know how many students it has. Estimates vary 
between 65,000 and 81,000 students; the discrepancy 
is alarming. NCES found a discrepancy of 20.6 per- 
cent between the 1990 census and the number of stu- 
dents reported by DCPS (figure 9). This discrep- 
ancy is significantly greater than any of the other 
major jurisdictions in the Greater Washington Met- 
ropolitan area. 

Lack of basic data on students and employees 
makes budgeting next to impossible. 

Financing Education in the District 
of Columbia Public Schools 

The District has a unique financing arrangement. 
Not only must the District carry out the roles of a 
city a county and a state, but it's budget must be 
directly approved by Congress. The result is confu- 
sion and, in terms of Congress, a level of meddling 
in local affairs that no other jurisdiction in the coun- 



Figure 7.— Percentage of high school teachers and students reporting violence and safety 
issues 
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SOURCE: Data Volume for the National Education Goals Report (Volume Two: State Data). 
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Figure 8. — Percentage of schools with inadequate features 
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SOURCE: GAO Reports — School Facilities: Profiles of School Conditions by State; Condition of America’s Schools. 




Figure 9. — Discrepancy between public school enrollments as reported by school systems 
and by 1990 census 
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try would permit. Frequently the congressional 
members attach all types of stipulations to the 
District's budget. In no area of the budget has this 
involvement been more than in education. Congress 
has mandated charter schools, English literacy re- 
quirements, public charter schools, a world class 
schools task force, per capita school funding, and 
the establishment of a commission on consensus 
reform. Public education budgeting in this environ- 
ment is clearly a challenge. 

Although the former Superintendent and the 
former President of the Board of Education argued 
that more dollars are needed to support the District's 
schools, it remains to be determined whether or not 
additional operational funding is needed in light of 
an assessment of per capita spending. Funding for 
the District's public schools comes from local funds, 
and federal and private grants. Funding peaked in 
Fiscal Year (FY) 1994 and has declined slightly since 
then. Figure 10 shows funding over the past 10 years 
in constant 1996 dollars. 



Comparisons of the District's school expendi- 
tures with other jurisdictions are difficult because 
of the many ways in which expenditures are re- 
ported. However, the District's average cost per stu- 
dent clearly exceeds the national average, and it is 
also substantially higher than many comparable ur- 
ban school districts and neighboring districts. Fig- 
ure 11 depicts these comparisons. 

Even though spending per student in the Dis- 
trict is high, much of that funding is not reaching 
the classroom. The 1992 report of the DC Commit- 
tee on Public Education (COPE) found that DCPS 
spends "less than other districts on instruction and 
more on central administration and overhead." Four 
years later, DCPS expenditures toward classroom in- 
struction continued to lag behind that of its neigh- 
bors. 

Figure 12 compares DCPS' instructional spend- 
ing to the Washington suburban jurisdictions of 
Fairfax and Montgomery counties. 



Figure 10. — District of Columbia public schools’ funding from local, federal, and other 
funds (in constant 1996 dollars): FY 1985-95 



Funding 




Year 

NOTE: All funds are adjusted to 1996 dollars based on consumer price indexes for major expenditure classes. 

SOURCE: District of Columbia Public Schools Comprehensive Annual Financial Report, 1995; DCPS Component Unit 

Financial Report, 1994. 
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Figure 11. — Average per-pupil expenditures: 1994-95* 



Per pupil expenditures 




*These expenditures for the District do not include any cost of financing capital expenditures or the costs of the teacher 
pension program. It is unclear whether other jurisdictions 1 average cost per student includes such costs. However, these 
costs would add another $1,250 per student in the District. 



NOTE: DC expenditures per pupil assume 80,450 students and total expenditures of $615.8 million and includes food 
services, continuing education, and other Federal education costs. 

SOURCE: FY 1996 Budgets; National Center for Education Statistics. 



Figure 12. — Comparison of District of Columbia public schools’ (DCPS) instructional and 
non-instructional spending to Washington suburban jurisdictions 



DCPS Spending Priorities 




Neighboring Districts Spending Priorities 
(Montgomery and Fairfax) 
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SOURCE: FY 1997 Budgets. 
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DCPS' large central administration takes away 
from instructional spending. DCPS' teacher to cen- 
tral administrator ratio highlights the size of its ad- 
ministration. Figure 13 depicts the fact that DCPS 
employs 16 teachers for every central administrator 
employed, compared with its peers who employ 42 
teachers for every central administrator employed. 

In 1996, DCPS allocated more toward its Office 
of the Superintendent than the Fairfax County, Mont- 
gomery County, and Baltimore City public school 
systems combined. DCPS exceeded its neighbors' 
average allocation by $4.55 million and its peers' by 
$3.38 million (figure 14). 

DCPS spent more than twice as much on its Of- 
fice of the Board of Education than peer and neigh- 
boring district average. In FY 1996, DCPS allocated 
over $1.4 million to the Office of the Board of Edu- 
cation — more than three times the average $454,000 
of neighboring Fairfax and Montgomery counties 
(figure 15). DCPS even allocated $203,000 more than 
Chicago Public Schools (CPS), despite the fact that 
the CPS board was responsible for overseeing a dis- 
trict with 400,000 students — five times as many as 
DCPS. 



Authority Actions and Progress 

With the deplorable conditions outlined in our 
report, the Authority believed that strong action 
needed to be taken. Therefore on November 15, 
1996, the Authority, through use of powers granted 
by Congress, ordered the removal of the Superin- 
tendent and reduced the powers of the elected 
Board of Education. In their place, the Authority 
installed a retired Army Lieutenant General, Julius 
Becton, and established an Emergency Transitional 
Education Board of Trustees made up of experi- 
enced leaders with the ability to instill a vision and 
lead a large organization in the direction of that vi- 
sion until June 2000. 

The Authority also ordered the Trustees and the 
CEO to develop specific measures that will be used 
to gauge their success in meeting the following 
goals: 

• enhancing the quality of education at all Dis- 
trict schools; 

• improving student participation, perfor- 
mance, and outcomes; 
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Figure 14. — Office of the Superintendent (or equivalent) adopted budget (in thousands of 
dollars): 1996 



Budget amount 




DC Neighboring Fairfax Montgomery Peer Baltimore Chicago Cleveland Milwaukee Newark 
average County County average 



SOURCE: School District Budgets or Comprehensive Annual Financial Reports, FY 1996. 



Figure 15. — Board of education (or equivalent) adopted budget (in thousands of dollars): 
1996 
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• improving the quality, effectiveness, and effi- 
ciency of school system management; 

• creating internal accountability and focus on 
customer satisfaction; 

• improving the physical conditions of school 
properties; and 

• reducing incidences of crime, especially of- 
fenses that threaten student and staff safety. 

Within months of instituting the new structure. 
General Becton and the Board of Trustees found that 
as bad as the Authority's report painted the District's 
school system, the true situation was even worse: 

• stacks of unpaid bills piled in boxes; 

• personnel actions that had not been processed 
for months, even years; 

• a special education program that was com- 
pletely broken; 

• numerous teachers who were not certified to 
teach; 

• several personnel on the payroll who had 
died; 

• no plans for facilities improvements; and 

• a maintenance contractor who, according to 
the District's Inspector General, had over- 
charged the struggling school system by more 
than $6 million over the past 3 years. 

To correct these problems, the new management 
team has: 

• hired an academic officer, an operating officer, 
a human resource officer, and a chief finan- 
cial officer; 

• fired budgeting and finance personnel; 

• updated all personnel actions and imposed a 
hiring freeze; 

• developed a performance measurement sys- 
tem; 

• developed a facilities plan and abated more 
than 1,600 fire code violations; 



• closed 11 schools and replaced approximately 
50 roofs; 

• painted and spruced up over 40 schools (vol- 
unteers); 

• increased security with new metal detectors 
and increasing training for security guards; 

• established a teacher evaluation program; 

• proposed new academic standards that end 
social promotions and require passing stan- 
dardized reading and mathematics tests be- 
fore progressing to certain key grades and a 
C average before graduating from high 
school; 

• terminated a large school maintenance con- 
tract; 

• awarded new contracts to provide better 
school breakfast and lunches; and 

• provided closer coordination and interface 
with city personnel and financial manage- 
ment systems to record the school's employ- 
ees and budget expenditures more accurately. 

These are obviously just a beginning, and there 
have been some slips along the way such as the 3- 
week delay in school opening to finish roof replace- 
ment. General Becton and his chief operating of- 
ficer have recently resigned. The chief academic 
officer replaced General Becton. But true reform, 
particularly in the academic area, is a long-term task 
and cannot be achieved overnight. The Authority 
will be closely monitoring the progress on the new 
management team to ensure that significant progress 
is made. 

Higher Education in the District of 
Columbia 

Public higher education in the District is princi- 
pally provided by the University of the District of 
Columbia (UDC). UDC is unique in that it is the 
only urban land grant institution of higher educa- 
tion in the United States. UDC has an open enroll- 
ment policy and about 4,500 full-time equivalent stu- 
dents. 
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As a part of the Authority's efforts to revitalize 
public education in the District, we have undertaken 
a comprehensive review of higher education services 
in the District of Columbia, specifically UDC. UDC 
has had a difficult time adjusting to reductions in 
public funds, which currently are about 50 percent 
of the UDC budget. Our review of UDC found: 

• UDC has never had a clear mission; 

• the role of the faculty in the shared gover- 
nance model has never been sufficiently de- 
tailed; 

• operations and management functions have 
failed to provide efficient or effective services 
in support of the academic, community, or re- 
search mission of the institution; 

• academic programs are not focused in meet- 
ing student or market needs; and 



• UDC's allocation of resources is misdirected. 

In summary, the Authority has recognized that 
if the District's quality of life and government is truly 
going to get better, we must fully invest our re- 
sources and efforts toward providing a better-than- 
adequate education for students in the District of 
Columbia school system. In making an investment 
in the world class education our students deserve, 
we will find what other systems have found — a bet- 
ter functioning school system will offer our youth 
hope for the future, a better opportunities for life- 
time employment, and increased awareness of life's 
opportunities. In doing so we will find, just as other 
cities have found, that as we invest in education, 
the need for human service dollars will decrease as 
our students are better educated and are better pre- 
pared for a productive life. 
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Introduction 

Do expenditures on school resources have a 
positive effect on student outcomes? This question 
is important to many audiences: parents of school- 
aged children; citizens concerned about the effec- 
tiveness of their tax dollars; educators trying to im- 
prove student outcomes; and state policymakers 
charged with developing fair school finance formu- 
las. Despite thirty years of research by economists, 
sociologists, and educational researchers, beginning 
with the Coleman Report (1966), this question still 
has no definitive answer. 

Most economic analyses take an "educational 
production function" approach. These studies use 
econometric techniques to relate educational out- 
comes (e.g., students' academic achievement) to 
school inputs while controlling for other contribu- 
tions such as those of the students themselves, their 
families, peers, and communities. Within this broad 
framework, educational production function stud- 
ies exhibit a wide range of empirical approaches. 1 
They vary in their choice and measurement of edu- 



cational outcomes, explanatory variables of inter- 
est, and control variables. They also differ in their 
geographical scope and their unit of analysis. 

Findings from these studies are as mixed as their 
empirical approaches are varied. Some studies es- 
timate large, positive effects of school inputs on stu- 
dent outcomes; others find little or no effect; still 
others conclude that additional school resources are 
inversely related to student outcomes. The most 
well-known result of this vast literature is 
Hanushek's (1986, 1989) conclusion of "no strong 
or systematic relationship between school expendi- 
tures and student performance." Hanushek's find- 
ing is based on his syntheses of more than thirty 
separate educational production function studies. 2 
A more recent synthesis by Hedges, Laine, and 
Greenwald (1994) challenges the validity of the ana- 
lytical method of "vote counting," employed by 
Hanushek. Using the same primary studies as 
Hanushek's 1989 analysis, but a more sophisticated 
synthesis methodology known as "meta-analysis," 
Hedges, Laine, and Greenwald reach the opposite 



1 The many approaches of educational production function studies are reviewed by Hanushek (1979, 1986), Cohne and Geske 
(1990), and Monk (1992). 



2 Hanushek's famous 1986 analysis in the Journal of Economic Literature includes 147 regressions from 33 separate education 
production function studies. His updated 1989 study in Educational Researcher includes 187 regressions from 38 primary 
O studies. He reports the exact same conclusion in the two synthesis studies. 
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conclusion. 3 They find a statistically significant and 
economically substantial, positive relationship be- 
tween school inputs and student outcomes. 

The relevance of the findings from these syn- 
theses depends not only on the quality of their meth- 
odological approaches but, more importantly, on the 
quality of the primary research studies. In review- 
ing the primary studies considered in these synthe- 
ses, I find that none of the primary studies ad- 
equately accounts for across-district variations both 
in the resource costs of educational services (nota- 
bly teacher compensation) and in the proportion of 
students with special needs, who require additional, 
more costly services. 

These variations in resource costs and student 
needs are significant. The power of school districts 
to purchase a standard " market basket" of educa- 
tional resources varies by twenty to forty percent 
within states and as much as forty percent across 
states (Chambers, 1981; McMahon, 1995). Student 
needs vary widely across districts as well, with the 
proportion of special-needs students approaching 
fifty percent in some large urban school districts 
(Odden & Picus, 1992). I expect that a stronger rela- 
tionship between student achievement and school 
expenditures will emerge after accounting for these 
resource-cost and student-need differentials. 

To test this hypothesis, I use a unique data set 
merged from three high quality, national data 
sources: the National Education Longitudinal Study 
of 1988, the Common Core of Data, and a district- 
level teacher cost index. 4 I specify and estimate a 
value-added student achievement model for which 
my explanatory variable of interest is per-pupil ex- 
penditures. I find that the estimated effects of per- 
pupil expenditures on high school students' aca- 
demic achievement are consistently positive and 



statistically significant. However, these effects do 
not increase appreciably when the measure of ex- 
penditures is corrected to account for resource-cost 
differentials or when differences in the proportions 
of special-needs students are taken into account. 

The remainder of this paper is organized as fol- 
lows: I first present my conceptual model and de- 
scribe the data sources, sample, and variables used 
in my empirical analysis. Next, I explain how I con- 
ducted my estimations and present and discuss the 
results. Lastly, I summarize my findings and pre- 
sents suggestions for future research. 

Conceptual Framework 

Educational Production Function Studies 

My conceptual model is the basic value-added, 
reduced-form specification of the educational pro- 
duction function presented in Hanushek's (1979, 
1986) reviews. The educational outcome of interest 
is academic achievement. An individual student's 
achievement at time t (A t ), is modeled as a function 
of the student's prior achievement (A t J, other stu- 
dent characteristics and effort (I), and the influences 
of the student's family (F), peers (P), school (S), and 
community (C) during the period between t* and t. 
That is, 

A t j(A.^ F P ^ t-t* ^ t-t 

The effects of the school inputs on achievement 
are of primary interest in educational production 
function analyses. The types of school inputs con- 
sidered in these analyses depend on the policy ques- 
tions being addressed. Studies that focus on how 
schools allocate their funds typically consider 
teacher/ pupil ratios, and teachers' education levels 
and years of experience as the school inputs. My 
policy interests involve the equity of school finance 



3 Hanushek's analytical method of "vote counting" examines only the sign and level of statistical significance of the estimated 
effects of the seven different school inputs on student performance. He gives one "vote" to each estimated effect with a 
positive sign. Whether he considers only those effects that are statistically significant or he ignores statistical significance, 
Hanushek concludes that the proportion of positive effects is too small to indicate a strong relationship between school inputs 
and student performance. 

Hedges, Laine, and Greenwald's "meta-analysis" considers not only the signs but also the magnitudes of the estimated effects 
of school inputs on student outcomes. Additionally, their more sophisticated methodology accounts for dependence among 
regressions estimated within the same study using slightly different empirical specifications and among regressions in different 
studies that used the same data sources. 

4 The teacher cost index was developed by Jay Chambers of the American Institutes for Research, and like the other data sources, 
was released by the U.S. Department of Education's National Center for Education Statistics. 
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formulas; hence, I consider schools' fiscal resources 
as the school input of interest. 

The efforts of states to provide more equitable 
educational opportunities and student outcomes by 
reducing across-district disparities in schools' fiscal 
resources inspired my two primary research ques- 
tions: 1) Is there a positive, systematic relationship 
between student performance and schools' fiscal re- 
sources? and 2) How does the strength of that rela- 
tionship depend on the precise measure of fiscal re- 
sources? Specifically, is the relationship between 
student achievement and per-pupil expenditures 
(PPEs) stronger when the PPE measure reflects the 
costs of educational services and the population of 
special-needs students? If this is the case, then states 
would be more likely to achieve their student eq- 
uity objectives by attempting to equalize not nomi- 
nal per-pupil expenditures, but rather per-pupil ex- 
penditures adjusted for costs and student needs. 

Variations in Costs 

One problem in educational pro- 
duction function studies that link 
schools' fiscal resources to student 
outcomes is that the costs of equiva- 
lent educational services vary widely 
across districts. Researchers estimate 
that these costs vary by twenty to 
forty percent within states and up to 
forty percent across states (Cham- 
bers, 1981; McMahon, 1995). In stud- 
ies that ignore such differential re- 
source costs, disparate outcomes for 
districts with identical expenditure 
levels seemingly lend support to the 
notion that money does not matter. In fact, higher 
student achievement should be expected in low cost 
districts which, for the same nominal expenditure 
level, can purchase more or higher quality real re- 
sources than high cost districts can afford, all else 
being equal. 

One recent production function study does at- 
tempt to account for variations in education costs 
by location. William Sander (1993) adjusts his ex- 
penditure and income variables by a cost-of-living 



index developed by Walter McMahon (1988), and 
finds that teacher-related spending is positively re- 
lated to ACT scores in Illinois. Although Sander's 
study represents an improvement over the prior lit- 
erature, cost-of-living adjustments do not ad- 
equately account for educational price differentials. 

The cost of living is but one factor affecting the 
attractiveness of a school district as a place to live 
and work. Other characteristics — including the size 
of the school district, the types of students served, 
the crime rate, the level of pollution, the climate, 
access to medical facilities, availability of recre- 
ational opportunities, and consumption opportuni- 
ties — also affect the attractiveness of districts, and 
ultimately affect the salaries that are required to at- 
tract and retain individuals with specific profes- 
sional characteristics (Chambers, 1981). A cost-of- 
living adjustment fails to adequately account for 
variations in salaries of school personnel due to dif- 
ferences in job and regional char- 
acteristics. Since personnel costs 
comprise at least 80 percent of 
school expenditures and since 
variations in personnel costs 
dominate the pattern of cost dif- 
ferences across districts it is impor- 
tant to account for them (Cham- 
bers and Fowler, 1995). 5 

While a number of approaches 
have been taken in efforts to de- 
velop an index for personnel costs 
(see Chambers, 1981, pp. 45-52), 
Chambers argues that the most ap- 
pealing approach is based on the 
hedonic wage model. The theoretical framework, 
established by Lucas (1972), maintains that through 
a simultaneous process of matching the attributes 
of individual employees and the working conditions 
offered by employers, differential wages are deter- 
mined. In its application to the market for school 
personnel, hedonic wage theory recognizes that dif- 
ferences in the characteristics of school districts re- 
quire different salary levels to attract the types of 



. . . disparate out- 
comes for districts 
__ with identical expendi- 
ture levels seemingly 
lend support to the 
notion that money 
does not matter. 



5 Transportation and energy costs vary widely across districts as well, but account for a much smaller portion of schools' 
^ expenditures. 
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personnel needed to provide a given level and qual- 
ity of educational services across districts. 

The personnel index indicates the relative cost 
of employing workers with similar skills and jobs 
in different environments. The different environ- 
ments are characterized by district and regional fac- 
tors that are beyond the control of local school deci- 
sion-makers (Chambers, 1981, p. 63). 6 The types of 
district and regional factors considered reflect the 
overall quality of the environment within which the 
individual works and lives as well as the condition 
of the labor market in which prevailing wages and 
employment levels are determined. Thus, a per- 
sonnel cost index accounts for variations in district 
and regional characteristics, controlling for personal 
and job assignment characteristics. 

Adjusting expenditures by a personnel cost in- 
dex allows for more meaningful 
comparisons of PPE levels across 
districts that face different resource 
costs. We would expect that cost-ad- 
justed expenditures are better at 
capturing the quantity and quality 
of the educational services pur- 
chased, and that such "real" mea- 
sures should be more closely related 
to student performance than the 
typically considered "nominal" 
measures. 

Variations in Student Needs 

In educational production func- 
tion analyses for which the obser- 
vations are individual students, the ideal measure 
of a school's fiscal inputs would be the dollars (ad- 
justed to reflect resource costs) spent on each indi- 
vidual student. However, school expenditures are 
most accurately measured (and often only available) 
at the district level and are difficult to accurately 
allocate to schools, classrooms, or individual stu- 
dents. Hence, whether the unit of analysis is indi- 
vidual students, schools, or districts, most analyses 
that focus on fiscal resources simply use district-level 
PPEs — total district expenditures divided by the 



total number of students in the district — as the mea- 
sure for school inputs. Just as nominal expenditure 
levels make for poor comparisons across districts 
with different resource costs, simple PPEs make for 
poor comparisons across districts with different pro- 
portions of special-needs students. 

The distribution of special-needs students — in- 
cluding special education, compensatory education, 
and limited English proficiency (LEP) students — is 
not uniform across school districts. The incidence 
of students with physical and mental handicaps 
varies widely across states and districts. Large, ur- 
ban districts and small, rural districts tend to have 
higher proportions of students for whom English is 
not the primary language. Urban and rural areas 
also tend to serve a higher proportion of students 
living in poverty (Odden and Picus, 1992). The costs 
of providing services to these special-needs students 
vary depending on such factors as 
the number and types of students 
with special needs, the size of the 
school, and the kinds of services 
provided. In general, though, stud- 
ies estimate that special education 
programs are about 2.3 times as 
costly as regular programs (Kakalik 
et al., 1981; Moore, Strang, 
Schwartz, and Braddock, 1988; 
Chaikind, Danielson, and Brauen, 
1993), and compensatory and LEP 
programs are at least 20 percent 
more costly (Odden and Picus, 
1992; Parrish, Matsumoto, and 
Fowler, 1995). 

A variety of federal and state aid programs are 
designed to help districts offset the additional costs 
of providing extra services for special-needs stu- 
dents. Under Chapter 1 of the Elementary and Sec- 
ondary Education Act (ESEA), the federal and state 
governments provide extra funds to districts for 
compensatory education. Title VII of the ESEA 
makes funds available for bilingual education pro- 
grams. The federal Education for All Handicapped 
Children Act mandates and helps fund special edu- 



Just as nominal 
expenditure levels 
make for poor compari- 
sons . . . with different 
resource costs, simple 
PPEs make for poor 
comparisons across 
districts with different 
proportions of special- 
needs students. 



6 It is essential that adjustments for differential costs of education be based only on factors which are beyond the control of 
district decision-makers, so that inefficient spending practices are not encouraged. 
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cation programs. Analyses of expenditures that in- 
clude these additional funds should also reflect the 
size of the special needs population for whom these 
funds are provided. 

Because the distribution of special-needs stu- 
dents varies widely among school districts, simple 
comparisons of PPEs across districts fail to reflect 
differences in school resources available for the av- 
erage student. Districts with smaller proportions 
of the more costly special-needs students, in effect, 
have more money to spend on the average student 
than do schools with higher proportions of these 
special-needs students, ceteris paribus. Hence, in 
educational production function studies relating 
school expenditures to student achievement, con- 
trol variables for the proportion of special-needs stu- 
dents in each district need to be included in the re- 
gressions. 

Hypothesis 

Figures 1-3, show how I expect 
these variations in resource costs and 
students' needs to affect the relation- 
ship between student achievement and 
school expenditures. Figure 1 is a styl- 
ized representation of Hanushek's con- 
clusion that there is no relationship be- 
tween student achievement and school 
expenditures. Figure 2 illustrates my 
hypothesis. I expect that districts with 
higher levels of student achievement 
and lower nominal expenditures (up- 
per left portion of graph) face lower 
costs of education and have relatively 
fewer special-needs students. Under these condi- 
tions, the adjusted measure of PPEs would be higher 
than the nominal measure. (The arrows represent 
the change in PPE measure from nominal to ad- 
justed.) Similarly I expect that districts with lower 
levels of student achievement and higher nominal 
expenditures (lower right portion of graph) face 
higher costs of education and serve a higher pro- 
portion of special-needs students. For these districts, 
the adjusted measure of PPEs would be lower than 
the nominal measure. If my expectations are cor- 
rect, then a (larger) positive relationship between 
student achievement and school expenditures 
should emerge as the measure of expenditures is 



adjusted to account for these differences in resource 
costs and student needs (see figure 3). 

Empirical Model 

Data Sources 

This study uses data merged from two large data 
sets and a smaller data file, each released by the 
National Center for Education Statistics. The first 
source is the restricted-use version of the National 
Education Longitudinal Study of 1988 (NELS), a 
general-purpose panel study that surveyed and 
tested eighth graders from about 1,000 public and 
private middle schools in the spring of 1988 and fol- 
lowed these students through high school. The first 
three waves of NELS include scores on cognitive 
tests administered to students in 1988, 1990, and 
1992 as well as information from questionnaires ad- 
ministered to students, their parents, teachers and 
school administrators over the 
same time period (Ingels et al., 
1994). 

The second source is the 
Common Core of Data (CCD), 
an annual, comprehensive data- 
base containing descriptive data 
on all public elementary and 
secondary schools and school 
districts in the United States. 
The CCD also contains en- 
hanced financial data at the dis- 
trict level for fiscal years 1990, 
1991, and 1992. Additionally, the 
CCD contains demographic in- 
dicators derived from special 
tabulations for school districts from the 1990 Cen- 
sus (National Center for Education Statistics, 1995). 

The third, smaller data source is a national, dis- 
trict-level teacher cost index (TCI) developed by Jay 
Chambers of the American Institutes for Research. 
Chambers' TCI reflects across-district variations in 
non-discretionary resource costs of teacher services. 
Based on a hedonic wage model, the TCI was cre- 
ated using survey data from over 40,000 public 
school teachers who participated in the NCES's 
Schools and Staffing Survey for school year 1990- 
1991. Chambers' TCI is the only nationwide, dis- 



Districts with smaller 
proportions of .. . 
special-needs stu- 
dents, in effect, have 
more money to spend 
on the average stu- 
dent than do schools 
with higher propor- 
tions of these special- 
needs students . . . 
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Figure 1. — With the traditional measure of per-pupil expenditure (PPE), no relationship 
between school expenditures and student achievement is evident 

Student achievement 



Higher 



Medium 



Lower 



Lower Medium Higher 

School expenditures (nominal PPE) 



SOURCE: Author’s illustration. 



Figure 2.— Adjusting expenditures to account for the cost of education and special-needs 
students may bring a new picture into focus 



Student achievement 
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Lower 



Lower cost of education and 
fewer special-needs students 
means that adjusted PPE is 
higher than nominal PPE. 



Higher cost of education and 
more special-needs students 
means that adjusted PPE is 
lower than nominal PPE. 



Lower Medium Higher 

School expenditures (nominal & adjusted PPE) 



SOURCE: Author’s illustration. 
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Figure 3. — A positive relationship between school expenditures, measured by adjusted per- 
pupil expenditure (PPE), and student achievement is expected to emerge 



Student achievement 




SOURCE: Author's illustration. 



trict-level index available that takes into account 
both the factors that underlie differences in the cost 
of living and variations in other teacher and school 
attributes that are within local control (Chambers 
and Fowler, 1995). Appendix A describes the con- 
struction of the TCI. 

Sample 

My sample is drawn from those students who 
participated in all the first three waves of the NELS 
panel study (16,489 students). I consider only stu- 
dents attending public schools (11,598) because they 
are the only ones to whom I can assign reliable, com- 
parable expenditure data from the CCD. 7 I further 
refine my sample to include only students who 
never dropped out of school (11,503) and who at- 
tended the same high school in both 1990 and 1992 
(11,167). 



These restrictions are imposed because I want 
to consider only those students who are consistently 
associated with school resources at particular 
schools. The disadvantage is that these students 
constitute a more stable student body than is re- 
flected in the total student population. To the ex- 
tent that dropout rates, transfer rates, or participa- 
tion in all three waves of the NELS survey are sys- 
tematically related to PPE levels, my findings are 
not generalizable to the entire student population; 
rather, they must be qualified to apply to this more 
stable group of students. 

I further eliminate observations with missing 
data in three critical areas: test scores, special-needs 
students, and TCI values. I lose a substantial num- 
ber of observations by considering only students 
with complete test score data in both 1988 and 1992; 
this restriction leaves 7,854 students. 8 EUminating 
observations lacking CCD data on the number of 
special-needs students and observations with miss- 



7 NELS oversampled students in private schools; hence, the large proportion of students who are eliminated given that I consider 
only students who attend public schools. 

R In this paper I do not tackle the potential "pretest to post-test selection problem" discussed by Becker and Walstad, 1990. 
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ing TCI values leaves a sample size of 6,990. Miss- 
ing values for some control variables reduce the 
number of observations used in the regression com- 
putations to 5,955. 9 

Variables 

The dependent variable in my regression equa- 
tions is the student's 1992 (senior year for most of 
the students) score on the NELS mathematics test. 
The specific measure I use for mathematics achieve- 
ment is the item response theory (IRT) theta score, 
which is standardized to a mean of 50 and a stan- 
dard deviation of 10. To eliminate floor and ceiling 
effects, three forms of the mathematics tests were 
administered to the students in 1992, depending on 
their prior achievement. Students who performed 
in the highest quartile on the 1990 test were given 
the most difficult version of the 1992 exam; those in 
the lowest quartile in 1990 received 
the easiest version of the 1992 exam; 
and the rest of the students received 
the test of medium difficulty in 1992. 

Item response theory was used to 
calculate scores that could be com- 
pared across test forms that differed 
across the years and across the stu- 
dents in a given year. The theta score, 
which is standardized across the 
three waves of testing is the best 
score to use when assessing gains in 
cognitive skills. (See Ingels et al., 

1994 for more information about 
NELS testing and IRT scoring.) 

The independent variables in- 
clude controls for achievement in eighth grade, in 
order to analyze the gam in cognitive outcomes dur- 
ing the high school years. I include both the 1988 
mathematics IRT theta score and the average 1988 



IRT theta score on the other three NELS tests — sci- 
ence, reading, and social studies — as control vari- 
ables. 10 I use the average of the other test scores as 
an additional control to reduce bias from unmea- 
sured pre-existing differences among students (see 
Gamoran, 1996; Gamoran and Mare, 1989; and 
Jencks, 1985). I expect to find strong, positive rela- 
tionships between these measures of prior achieve- 
ment and the measure of achievement on the math- 
ematics test in 1992. 

Other control variables included in my empiri- 
cal analysis capture student and family characteris- 
tics, the student's interest and effort in mathemat- 
ics and in school, and characteristics of the student's 
peers, school, and community. Descriptive statis- 
tics for these control variables are reported in table 
l. 11 Definitions and sources for all the variables are 
provided in appendix B. 

Methodological 
Approach 

Recall that two primary ques- 
tions are addressed in this study. 
First, do these high quality, nation- 
wide data reveal a positive rela- 
tionship between student achieve- 
ment and PPEs? Second, is the es- 
timated effect of PPEs on student 
achievement strengthened by ac- 
counting for across-district varia- 
tions in resource costs and student 
needs? Addressing the first ques- 
tion is a straightforward matter of 
examining the statistical signifi- 
cance and substantive magnitude of the coefficient 
estimates on the PPE variables. Addressing the sec- 
ond question is more involved. 



... is the estimated 
effect of PPEs on 
student achievement 
strengthened by 
accounting for 
across-district varia- 
tions in resource 
costs and student 
needs ? 



9 Other fields with missing data include: the percentage of students in the district living in single-parent homes; the percentage 
of students in the district in minority families; historical dropout rates in the high school; and enrollment in the twelfth grade. 
In future studies, I intend to impute values for missing data in these fields. 

10 I use the 1992 math score as the dependent variable and include the 1988 math score as a control variable, rather than 
using the gain in score as the dependent variable, because the former specification is less restrictive. In particular, the 
gain score specification implicitly assumes that the coefficient on the 1988 math score should be one. Typically, the 
coefficient estimate on prior achievement in the same subject is in the range of 0.70 to 0.80. 

11 The means and standard deviations are weighted to account for the oversampling of certain populations in the NELS three- 
wave panel. The weight used in computing these descriptive statistics is the relative weight, F2PNLWT1 / mean (F2PNLWT). 
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Table 1 . — Descriptive statistics* 










Variable 


Mean 


Standard deviation 


Minimum 


Maximum 


Dependent variable 


Math score, 1992 


54.19 


10.03 


27.07 


80.6 


Explanatory variables 

Prior achievement 


Math score, 1988 


45.71 


8.36 


24.89 


67. 


Average of other scores, 1988 


46.22 


7.54 


25.89 


66.2 


Student and family characteristics 


Minority 


0.23 


0.42 


0.00 


1.0 


Female 


0.50 


0.50 


0.00 




Single-parent family 


0.14 


0.35 


0.00 


1.0 


Socioeconomic status 


0.01 


0.76 


-2.43 


1.9 


Student interest and effort 


Interest and effort in math 


2.56 


1.34 


0.00 


4.00 


Time spent on homework 


6.64 


3.34 


0.00 


16.00 


Class attendance 


3.29 


1.22 


0.00 


5.00 


Student's view of school environment 


Perceives disruptive environment 


0.85 


1.01 


0.00 


4.00 


Experiences disruptive environment 


0.96 


1.37 


0.00 


7.00 


Peers’ characteristics 


Peers from single-parent homes 


2.61 


0.78 


1 


5 


Percent minority students 


23.67 


29.51 


0 


100 


Peers’ absenteeism 


0.47 


0.50 


0 


1 


Peers’ dropout rates 


2.02 


1.53 


0 


6 


Special-needs students 


Percent special education 


9.65 


4.17 


0 


23.16 


Percent with limited English proficiency 


1.95 


3.15 


0 


25.20 


Percent below poverty level 


16.99 


11.27 


0.40 


66.20 


Community characteristics 


Percent adults w / at least some college 


45.86 


15.02 


10.80 


92.00 


Median income for households w / kids 


36,907 


13,083 


11,337 


114.544 


School characteristics 

Size 


Twelfth grade enrollment 


275 


168 


12 


1110 


Problems in school 


Composite of minor to serious problems 


8.09 


4.44 


0 


15 


Type 


Comprehensive school 


0.91 


0.29 


0 


1 


Magnet school 


0.10 


0.29 


0 


1 


Public school of choice 


0.34 


0.47 


0 




Year-round school 


0.04 


0.18 


0 


1 


Vocational-technical school 


0.09 


0.28 


0 


1 


Region 


Midwest 


0.33 


0.47 


0 


1 


Northeast 


0.13 


0.33 


0 




South 


0.34 


0.47 


0 


1 


West 


0.20 


0.41 


0 


1 


Urbanicity 


Suburban 


0.45 


0.50 


0 




Urban 


0.20 


0.40 


0 




Rural 


0.35 


0.48 


0 


1 



* Weighted to reflect population means. 



SOURCE: Author’s calculations using the National Education Longitudinal Study (NELS) and Common Core of Data (CCD) 
data. 7 
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Coefficient Comparisons Across Regressions 

To address the second question I run four main 
regressions then compare the coefficient estimates 
on the PPE variables across these regressions. The 
four regressions differ only in their measure of PPE 
and in their controls for special-needs students. I 
consider two measures of PPE: nominal and cost- 
adjusted. "Nominal PPE" is calculated by simply 
dividing the district's expenditures by the number 
of pupils in the district. "Cost-adjusted PPE" di- 
vides the nominal PPE value by the teacher cost in- 
dex (TCI) times 100. (The TCI is centered at 100 in 
the population rather than at one; hence the need to 
multiply by 100.) Additionally, I consider two al- 
ternative specifications of the model: in the first 
specification I do not control for the proportion of 
special-needs students; in the second specification, 

I do. In the second specification I include separate 
control variables indicating the 
proportion of students in each of 
the following special needs catego- 
ries: special education, limited En- 
glish proficiency, and compensa- 
tory education. The combination 
of the two alternative PPE mea- 
sures and the two alternative speci- 
fications produce the four distinct 
regressions. 

To examine the robustness of 
the results, I consider three alter- 
native categories of expenditures. 

The three expenditure categories 
are: 1) total district expenditures; 

2) core current expenditures; and 

3) expenditures on instructional salaries. The first 
category encompasses all current operation and 
capital outlay expenditures. The second includes 
just three key types of current operation expendi- 
tures: instructional expenditures (salaries and ben- 
efits for teachers and aides, contracted services, and 
supplies), pupil support services, and instructional 
staff support. The third category is the narrowest 
of all: only instruction-related salaries for teachers 
and aides are considered. Table 2 reports descrip- 
tive statistics for the nominal and the cost-adjusted 
PPE measures in each of these, three expenditure 
categories. 



To meaningfully compare the coefficient esti- 
mates across regressions, the nominal and cost-ad- 
justed PPE measures used in the regressions need 
to be on a common scale. Therefore, I create a new 
variable, called "comparable cost-adjusted PPE," by 
multiplying each observation of the "cost-adjusted 
PPE" by a constant factor. The factor equals the ra- 
tio of the mean nominal PPE to the mean cost-ad- 
justed PPE. The factor differs slightly across the 
three expenditure categories, but in all cases is ap- 
proximately 0.987. (Descriptive statistics for the 
"comparable cost-adjusted PPE" measure are also 
presented in table 2. Note that the means for the 
nominal and comparable cost-adjusted PPE vari- 
ables are identical by design.) It is the "nominal 
PPE" and the "comparable cost-adjusted PPE" vari- 
ables that are included in the regressions, thus al- 
lowing for meaningful across-regression compari- 
sons of the coefficient estimates on the PPE variables 
within each expenditure category. 

Within each expenditure cat- 
egory, I expect to find that the mag- 
nitude of the coefficient on the PPE 
measures increases: 1) as the mea- 
sure changes from "nominal PPE" to 
"comparable cost-adjusted PPE"; 2) 
when the regressions control for spe- 
cial-needs students; and 3) as both 
cost and student needs are taken into 
account (i.e., we move from nomi- 
nal PPE and no controls to cost-ad- 
justed PPE and special-needs con- 
trols). 

Estimation Results 

The results confirm that student achievement on 
the 1992 NELS mathematics test is positively related 
to per-pupil expenditures. This result holds for all 
three expenditure categories, whether the PPE mea- 
sure is nominal or cost-adjusted, and whether or not 
control variables for special-needs students are in- 
cluded in the regression. Table 3 summarizes the 
estimated effects of the various expenditure mea- 
sures on achievement for both model specifications. 
The coefficient estimate is consistently positive and 
statistically different from zero, though it is substan- 



. . . student achieve- 
ment on the 1992 
NELS mathematics 
test is positively 
related to per-pupil 
expenditures. 
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Table 2. — Descriptive statistics, alternative measures of expenditures 

Mean Standard deviation Minimum Maximum 



Measure 1: Total district expenditures 



Nominal per-pupil expenditure (PPE) 


5,577 


1,871 


2,895 


14,918 


Cost-adjusted PPE 


5,655 


1,621 


2,957 


15,346 


Comparable cost-adjusted PPE 


5,577 


1,599 


2,912 


15,134 


(Comparability factor: 0.9862) 

Measure 2: Core current expenditures 


Nominal PPE 


3,394 


1,176 


1,819 


9,277 


Cost-adjusted PPE 


3,434 


953 


1,746 


8,496 


Comparable cost-adjusted PPE 


3,394 


942 


1,726 


8,398 


(Comparability factor: 0.9884) 

Measure 3: Instructional salaries 


Nominal PPE 


2,245 


724 


1,086 


5,934 


Cost-adjusted PPE 


2,274 


580 


1,014 


5,500 


Comparable cost-adjusted PPE 


2,245 


573 


1,001 


5,428 



(Comparability (actor: 0,9870) 

SOURCE: Author’s calculations using the National Education Longitudinal Study (NELS) and Common Core of Data (CCD) 
data. 



tively small. 12 For example, the coefficient on nomi- 
nal core PPE in the regression that controls for spe- 
cial-needs students is 0.381 . This coefficient means 
that for an additional $1,000 in per-pupil expendi- 
tures, the math score is expected to increase by 0.381 
points over the four years of high school. Given 
that typical gain in math score is about 8.5 points, 
the extra $1,000 per pupil raises test scores by only 
4 percent of what is already expected. 

The results lend mild support for the hypoth- 
esis that accounting for differential resource costs 
and student needs would reveal a stronger positive 
relationship between student achievement and 
school expenditures. In table 3, 1 use a solid arrow 
to indicate changes in the magnitude of the coeffi- 
cient that are in the expected direction; broken ar- 
rows indicate changes in the unexpected direction. 



While the direction of change is as expected in 13 of 
15 cases, the magnitude of the change is minuscule 
compared to the standard errors. Indeed, the confi- 
dence intervals for the coefficients within each of 
the three expenditure categories almost entirely 
overlap. 

Although not of primary interest in this study, 
it is interesting to examine the effects of the other 
explanatory variables included in the model. These 
other effects may shed light on the weak effects of 
the fiscal resources. Table 4 presents all the estimated 
effects from the regressions that use (comparable) 
cost-adjusted core expenditures per pupil as the ex- 
planatory variable of interest. Performance on the 
1992 mathematics test is positively and statistically 
significantly related to prior achievement in both 
math and other subjects. Higher math achievement 



12 Because the NELS observations do not come from a random sample, the reported OLS estimates of the standard errors may be 
understated. Using a hierarchical linear modeling technique to account for the clustering of students within schools, I found 
that the HLM standard errors were virtually identical to the OLS standard errors. This result is not surprising, since there were 
only ten students, on average, in each school in 1992, and the magnitude of the bias for the standard errors increases with the 
average group size. (See Moulton, 1990, p. 335.) Other departures from random sampling (e.g., oversampling minorities) may 
also require the imposition of higher standards in judging statistical significance. (See Ingels, et al., 1994, pp. 42-53.) The root 
design effect for the full panel, when using the mathematics IRT score as the dependent variable, is 2.273. Multiplying the OLS 
standard errors by 2.273 will give a conservative standard error to use in judging statistical significance. Even imposing this 
most stringent standard for the standard errors, all the coefficients of the expenditure variables are statistically greater than 
zero at the 5 percent level of significance. 
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Table 3. — Comparison of effects of expenditures on 1992 math score 



Coefficients from regressions differing in per-pupil expenditure (PPE) measure and special-needs controls 

Model 1 Model 2 

No special needs controls With special needs controls 

Measure 1 : Total district expenditures 



Nominal PPE 


0.221 


* 0.214 




‘■°i 




(.051) 


Comparable cost-adjusted PPE 


0.226 


‘ • T 

— ► 0.231 




(.049) 


(.050) 


Measure 2: Core current expenditures 








Nominal PPE 


0.374 _ 


* 0.381 


( '°1 


,3> 


(.094) 


Comparable cost-adjusted PPE 


0.406 


— ^ 0.444 




(.096) 


(.097) 



Measure 3: Instructional salaries 

Nominal PPE 

Comparable cost-adjusted PPE 



0.633 -*0.630 




0.649 * 0.700 

(.163) (-166) 



NOTE: Standard errors are in parentheses. Solid arrows indicate that the coefficient change is in the predicted direction. 
Broken arrows indicate that the coefficient change is opposite the predicted direction. 



SOURCE: Author’s calculations using the National Education Longitudinal Study (NELS) and Common Core of Data (CCD) 
data. 
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is also positively and significantly related to higher 
socioeconomic status. Females' performance on the 
math tests is worse than males', and minorities' per- 
formance is worse than non-minorities'. Students 
from single-parent homes perform worse than those 
from two-parent households, but not significantly 
so. All three separate measures of student effort are 
positive and statistically significant. Students who 
experience multiple disruptions at school perform 
worse than those in less disruptive learning envi- 
ronments. The signs on most of the other non-ex- 
penditure-related explanatory variables are gener- 
ally as expected. The most notable unexpected re- 
sult is the negative coefficient on the median income 
for households with children. The effects of the PPE 
variable were highly sensitive to the inclusion or 
exclusion of this income variable, even though the 
correlation coefficient is only about 0.5. The posi- 
tive coefficient on the percent of LEP students in the 
regressions that used control variables indicates that 
limited English proficiency may not be a substan- 
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tial handicap on math tests. Indeed, international 
studies consistently rank U.S. school children among 
the lowest in math performance. Perhaps in schools 
with higher proportions of LEP students, the stu- 
dents are able to draw more from their prior math- 
ematics knowledge. In future analyses, I will con- 
sider performance in the other NELS subjects as 
well. I expect, for example, that the coefficient on 
LEP students will be negative on the reading test. 

Conclusions and Directions for 
Future Research 

This paper contributes to the understanding of 
the effects of school expenditures on student 
achievement by drawing on three nationwide data 
sets which are merged to create a rich sample for 
the empirical analysis. I expected to find (1) that 
the relationship between student achievement and 
nominal expenditures would be weak, and (2) that 
the relationship between achievement and cost-ad- 
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Table 4.— Regression estimates of effects on 1992 math score 






Explanatory variable of interest is cost-adjusted core current per-pupil expenditure (PPE) 

Model 1 

No special-needs controls 


Model 2 

With special-needs controls 


Explanatory variable 


Coefficient 


Std. error 


Coefficient 


Std. error 


Intercept 


5.497 3 


0.738 


5.467 3 


0.840 


Prior achievement 


Math score, 1 988 


0.753 3 


0.013 


0.750 3 


0.013 


Average of other scores, 1 988 


0.230 3 


0.014 


0.231 3 


0.014 


Student and family characteristics 


Minority 


-0.732 2 


0.219 


-0.754 2 


0.218 


Female 


-1.359 3 


0.136 


-1 .350 3 


0.135 


Single-parent family 


-0.341 


0.194 


-0.312 


0.194 


Socioeconomic status 


0.91 3 3 


0.107 


0.926 3 


0.107 


Student interest and effort 


Interest and effort in math 


0.354 3 


0.052 


0.360 3 


0.052 


Time spent on homework 


0.1 80 3 


0.021 


0.1 82 3 


0.021 


Class attendance 


0.480 3 


0.059 


0.471 3 


0.059 


Student's view of school environment 


Perceives disruptive environment 


-0.201 1 


0.073 


-0.205’ 


0.073 


Experiences disruptive environment 


-0.382 3 


0.055 


-0.379 3 


0.055 


Peers' characteristics 


Peers from single-parent homes 


0.135 


0.092 


0.179 


0.093 


Percent minority students 


0.01 4 2 


0.004 


0.007 


0.004 


Peers’ absenteeism 


-0.179 


0.136 


-0.136 


0.136 


Peers’ dropout rates 


-0.107 


0.047 


-0.110 1 


0.047 


Community characteristics 


Percent adults w / at least some college 


0.020’ 


0.008 


0.028 2 


0.009 


Median income, hholds w/ kids (000s) 


-0.029’ 


0.010 


-0.035 2 


0.011 


School characteristics 
Size 


Twelfth grade enrollment (00s) 


0.226 3 


0.053 


0.1 70 2 


0.055 


Problems in school 


Composite of minor to serious problems 


-0.Q52 1 


0.018 


-0.046 1 


0.018 


Type 


Magnet school 


-0.133 


0.244 


-0.140 


0.244 


Public school of choice 


-0.61 9 3 


0.141 


-0.600 3 


0.142 


Year-round school 


0.996’ 


0.363 


0.769’ 


0.371 


Vocational-technical school 


0.374 


0.254 


0.535' 


0.257 


Region (vs. Midwest) 


Northeast 


0.796 1 


0.271 


0.774 1 


0.271 


South 


-0.163 


0.180 


-0.039 


0.189 


West 


0.210 


0.221 


-0.055 


0.236 


Urbanicity (vs. Suburban) 


Urban 


-0.486 1 


0.227 


-0.515' 


0.230 


Rural 


-0.278 


0.175 


-0.223 


0.177 


Per-pupil expenditures 


Cost-adjusted core current PPEs (000s) 


0.406 3 


0.096 


0.444 3 


0.097 


Special-needs students 


Percent special education 


— 


— 


-0.025 


0.017 


Percent with limited English proficiency 


— 


— 


0.118 2 


0.035 


Percent below poverty level 


n = 5,955 
R-squared = .74 


— -0.001 
n = 5,955 
R-squared = .74 


0.013 



— Not applicable. 



1 Coefficient is twice its standard error. 

2 Coefficient is three times its standard error. 

3 Coefficient is four or more times its standard error. 

SOURCE: Author’s calculations using the National Education Longitudinal Study (NELS) and Common Core of Data (CCD) 
data. 
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justed expenditures would be stronger and positive, 
when controlling for the population of special-needs 
students. Instead, I consistently found a small posi- 
tive relationship that was relatively insensitive to 
the cost-adjustments and special-needs controls. 
These results provide evidence that the lack of a 
strong relationship between student achievement 
and school expenditures cannot simply be attributed 
to mismeasurement of the schools' fiscal resources. 

In future research I intend to test the robustness 
of these results. I will consider alternative model 
specifications and methods of accounting for dif- 



ferential resource costs and student needs. It may 
be that I find no support for my hypothesis no mat- 
ter which model or adjustment factors are used/but 
given the dearth of work in this area, further explo- 
ration is warranted. I will examine the degree to 
which my results are due to assumptions linearity 
of the model's functional form. I will also examine 
the extent to which these results are dependent on 
my choice of cost-adjustment: Chambers' TCI. These 
and other avenues of exploration should shed fur- 
ther light on the potential effectiveness of school fi- 
nance reform in affecting student equity. 




f * 
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Appendix A. Teacher Cost Index 13 

The theoretical basis for Chambers' teacher cost 
index, (TCI) is the hedonic wage model. In this 
model, teachers care about both the quality of their 
work environment and the monetary rewards asso- 
ciated with particular employment opportunities. 
School districts care about the characteristics of their 
workers and the costs of hiring those workers. The 
hedonic wage model assumes that the simultaneous 
matching of teachers with school districts reveals 
the differential rates of pay associated with em- 
ployee attributes and working conditions offered by 
employers. Thus, the model allows for decomposi- 
tion of observed variations in wages into the im- 
plicit dollar values attached to each unit of the per- 
sonal and workplace characteristics. 

Chambers represents the reduced form of the 
hedonic wage model for teacher salaries as: 

(Al) ln(S ALARY =a + J3D + J3 R R. + J3 t T. + 
&C.+ As.+k.. 

"C i *S i ij 

where i indexes individual teachers and j indexes 
school districts. The dependent variable is the natu- 
ral logarithm of the annual earnings of the teacher 
from the school district. The explanatory variables 
can be divided into two broad categories: cost fac- 
tors and discretionary factors. The cost factors in- 
clude district (D) and regional (R) attributes that 
affect the willingness of teachers to live and work 
in these localities and that are beyond the control of 
local decision makers, e.g., competition in the mar- 
ket for teachers, factors underlying cost-of-living 
differences, amenities of urban and rural life, cli- 
matic conditions, racial-ethnic mix of students, and 



district size and growth. These cost factors are di- 
rectly used in calculating the TCI. The other cat- 
egory of explanatory variables used in the hedonic 
wage model includes discretionary factors — those 
within the control of local school district decision 
makers in the long run, such as the characteristics 
of the individual teachers (T), the attributes of the 
job or classroom to which they are assigned (C), and 
various school characteristics (S). These discretion- 
ary factors are included as control variables in the 
regression to eliminate their contribution to expen- 
diture differences across districts. (See table 1.1 of 
Chambers and Fowler, 1995, for details of the spe- 
cific variables included under each of these catego- 
ries.) 

The data used in the empirical estimation of this 
model are derived primarily from the Schools and 
Staffing Survey (SASS). They include responses 
from 46,750 public school teachers in 8,969 public 
schools and 4,884 public school districts. These data 
are supplemented by data from the Common Core 
of Data, the Census Bureau, the U.S. Geological Sur- 
vey, and the National Climatic Data Center. 

After estimating equation Al, a teacher cost in- 
dex is calculated for each school district based on 
the estimated coefficients and values of the cost fac- 
tors, while controlling for variations in the discre- 
tionary factors. The TCI for each school district; is 
calculated as: TCI. = exp[ (3 D (D. - D) + /3 r (R. - R) ]. 

The overall mean value for the TCI is 100. The 
index is greater than 100 for districts facing higher 
non-discretionary costs (e.g., the average TCI for dis- 
tricts in New York City is 130) and is less than 100 
for districts in low cost areas (e.g., the average TCI 
for districts in non-metropolitan Oklahoma is 80). 



This summary of the TCI draws heavily from Chambers and Fowler, 1995. 
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Appendix B. Definitions and 
Sources of the Variables 

Unless otherwise noted, the variables described 
below are based on variables from the NELS Stu- 
dent Component Data Files. Other sources of data 
include the NELS School Component Data Files 
(NELS School), the Common Core of Data (CCD), 
and the Teacher Cost Index (TCI). 

Dependent Variable 

Math score, 1992: Score on the mathematics 
achievement test in the spring of 1992, when most 
of the students were in twelfth grade. Uses NELS 
variable F22XMTH, the IRT Theta t-score. (See Ingles 
et al., 1994, p. H-33 for a description of the benefits 
of using this metric.) 

Explanatory Variables of Interest 

Six variables measuring per-pupil expenditures 
are used in these analyses. These are based on three 
categories of expenditures (total, core current, and 
instructional salaries) and two alternative calcula- 
tions of PPEs (nominal and cost-adjusted). 

The three categories of expenditures are from 
the CCD for Fiscal Year 1992 (School Year 1991-92). 
Expenditures are measured for the entire school dis- 
trict. 

• Measure 1 is total district expenditures, field 

CJTOTEXR 

• Measure 2 is core current expenditures, de- 
fined as instructional expenditures, pupil sup- 
port services, and instructional staff support: 
C_E13 + C_E17 + C_E07. 

• Measure 3 is instructional salaries only, 
C_Z33. 

The two methods of calculating PPEs are de- 
scribed below: 

• Nominal PPEs are calculated by simply di- 
viding each of the expenditure measures de- 
scribed above by the total number of students 



in the school district in School Year 1991-92 
(AG_PK12). For example, the formula for per 
pupil total expenditures is C_TOTEXP/ 
AGJPK12 . 

• Cost-adjusted PPEs are calculated by divid- 
ing expenditures by Chambers 7 teacher cost 
index (TCI) multiplying by 100, then divid- 
ing by the number of students in the district, 
e.g., (C_TOTEXP/TCI*100)/AG_PK12. 

Note that the cost-adjusted measure that is used 
in the regressions is rescaled to be comparable to 
the nominal measure within each category. See 
"Coefficient Comparisons Across Regressions." 

Control Variables 
Prior Achievement 

• Math score, 1988: BY2XMTH , eighth grade 
IRT Theta t-score. 

• Average of other scores, 1988: Average of 
1988 IRT Theta t-scores in reading, science, 
and social studies. (BY2XHTH + BY2XSTH + 
BY2XRTH) / 3. All these test scores are on the 
same metric; hence the simple average score 
is appropriate. 

Student and Family Characteristics 

• Minority: Student's race based on F2RACE1, 
recoded to l=Black, Hispanic, or Native 
American; 0=White or Asian. 

• Female: Student's sex based on F2SEX , 
recoded to l=female; 0=male. 

• Single-parent family: Adult composition of 
the student's household based onFAMCOMP, 
recoded to l=adult female only or adult male 
only; 0=two parents or guardians. 

• Socioeconomic status: F2SES1 , SES measure 
based on father's education level, mother's 
education level, father's occupation, mother's 
occupation, and family income, and using 
Duncan's Socioeconomic Index (1961). 
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Student Interest and Effort 

• Interest and effort in math: Composite vari- 
able based on the student's responses to ques- 
tions F2S21A-D: In your current or most re- 
cent math class, how often do you: 

— Pay attention in class? 

— Complete your work on time? 

— Do more work than was required of you? 

— Participate actively in class? 

Composite ranges from 0 (little effort) to 4 
(strong effort). 

• Time spent on homework: Sum of categori- 
cal data on hours spent on homework in 
school (F2S25F1 ) and out of school (F2S25F2). 
Sum ranges from 0 indicating no time to 16 
indicating over 40 hours per week. 

• Class attendance: Composite variable (uses 
F2S9A-F) measuring the student's attendance 
in classes, based on how often the student re- 
ports he or she: 

— Was late for school. 

— Cut or skipped class. 

— Missed a day of classes. 

— Was put on in-school suspension. 

— Was suspended or put on probation from 
school. 

Composite ranges from 0 to 5, where 5 indi- 
cates the student says he or she "never" did 
any of the above. 

Student's View of the School Environment 

• Perceives disruptive environment: Compos- 
ite of the student's perception of the school's 
learning environment, based on how strongly 
the student agrees with statements F2S7E-H: 

— I don't feel safe at this school. 

— Disruptions by other students get in the 
way of my learning. 

— Fights often occur between different racial 
or ethnic groups. 



— There are many gangs in school. 

Composite ranges from 0 to 4, where 4 means 
the student agreed or strongly agreed with 
all four statements. 

• Experiences disruptive environment: 
Composite measuring the student's personal 
experiences that indicate a disruptive learn- 
ing environment. The composite ranges from 
0 to 7 and indicates the number of affirma- 
tive responses to statements F2S8A-G: 

— I had something stolen from me at school. 

— Someone offered to sell me drugs at school. 

— Someone offered to sell me drugs on the 
way to or from school. 

— Someone threatened to hurt me at school. 

— Someone threatened to hurt me on the way 
to or from school. 

— I got into a physical fight at school. 

— I got into a physical fight on the way to or 
from school. 

Peers' Characteristics 

(All these variables are based on data from the 
NELS School File) 

• Peers from single-parent homes: F2C23, es- 
timate by school administrator of the percent 
of twelfth graders (in 1992) from single-par- 
ent homes. Coding: 1 indicates less than 10 
percent from single-parent homes; 5 indicates 
more than 75 percent. 

• Percent minority peers: Percentage of twelfth 
graders who are Black, Hispanic, or Native 
American. F2C22B + F2C22C + F2C22E. 

• Peers' absenteeism: Based on F2C21, aver- 
age daily attendance (ADA) rate for twelfth 
graders, recoded such that 0 indicates 95 per- 
cent ADA; 1 indicates 90 percent ADA < 95 
percent; 2 indicates 85 percent ADA < 90 per- 
cent; 3 indicates ADA < 85 percent. Peers' 
dropout rate: Based on F2C26, estimate of the 
percent of students who enter the twelfth 
grade who drop out before graduation. 
Coded such that 0 means none drop out; 1 
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means 0 percent dropout rate (DR) < 3 per- 
cent; 2 means 3 percent DR < 5 percent; 3 
means 5 percent DR < 7 percent; 4 means 7 
percent DR <10 percent; 5 means 10 percent 
DR < 20 percent; and 6 means 20 percent DR. 

Special-needs students 

(From the CCD Agency Database for School Year 

1991-92) 

• Percent special education: AG_SPED/ 
AG_PK12*100, number of special education 
students in the district divided by the total 
number of students in the district, times 100. 

• Percent with limited English proficiency: 
P7028TP, percentage of children in the district 
who speak English "not well." 

• Percent below poverty level: P7118TP, per- 
centage of children in the district living be- 
low the poverty level. 

Community Characteristics 

(From the CCD Agency Database for School Year 

1991-92) 

• Percent adults with at least some college: 
P120403P + P120404P, percentage of adults in 
the district with some college, or a bachelor's 
degree or higher degree. 

• Median income for household with kids: 

P3080A0 1. 

Size of Class; Problems in School 

(From the NELS School File) 



abuse of teachers, racial/ ethnic conflicts, and 
teen pregnancy. . 

School Characteristics 

(From the NELS School File) 

In the NELS School File, public schools are clas- 
sified as the following types: 

• Comprehensive school (not including mag- 
net school or school of choice); 

• Magnet school (including schools with mag- 
net programs, schools within a school); or 
School of choice (open enrollment/ non-spe- 
cialized curriculum). 

For each of the three types of schools, I as- 
sign a 1 if the administrator indicated that the 
school met the characteristics of that type of 
school and a 0 if not. Although the definition 
of comprehensive schools specifically ex- 
cludes magnet schools or schools of choice, 
the data reveal that some administrators in 
magnet schools and/ or schools of choice 
marked that they were also comprehensive 
schools. In my regression analyses I do not 
include a variable for comprehensive schools; 
I do include dummy variables for magnet 
schools and schools of choice. 

Zero-one dummy variables are also included for 
two other characteristics of schools: 

• Year-round schools; and 

• Vocational-technical schools. 



• Twelfth grade enrollment: Enrollment of 
twelfth graders as of Oct. 1991, based on F2C2. 

• Problems in school: Composite of school 
problems as judged by the school adminis- 
trator (using NELS variables F2C57A,C-P). 
Composite ranges from 0 to 15, where higher 
values indicate more of the following prob- 
lems: tardiness, class cutting, physical con- 
flicts, gang activity, robbery or theft, vandal- 
ism, use of alcohol, use of illegal drugs, stu- 
dents under the influence of alcohol or drugs 
while at school, sale of drugs near school, 
possession of weapons, physical or verbal 

Er|( 52 



Region of the Country 

Zero-one dummy variables indicate in which of 
four US Census regions the student attended school 
in 1992, based on G12REGON. 



• Midwest — East North Central and West 
North Central states; 

• Northeast — New England and Middle Atlan- 
tic states; 

• South — South Atlantic, East South Central, 
and West South Central states; and 

• West — Mountain and Pacific States. 
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Urbanlcity 

Zero-one dummy variables indicate the 
urbanicity of the school the student attended in 1992, 
based on G12URBN3. 

• Urban — central city; 

• Suburban — area surrounding a central city 
within a county constituting an Metropolitan 
Statistical Area (MSA); and 

• Rural — outside an MSA. 
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Introduction 



After more than 30 years of research, social sci- 
entists have made little progress in identifying the 
educational production function. "Production func- 
tion" studies are those that use some form of multi- 
variate analysis, such as regression analysis, to mea- 
sure associations between various educational in- 
puts, such as per-pupil expenditures, and outputs, 
such as academic achievement as measured by stan- 
dardized tests. 1 One of the earliest studies of this 
type was the Equality of Educational Opportunity 
Report, commonly referred to as the Coleman Re- 
port (Coleman et al. 1966). This study found little 
association between inputs and outputs for a na- 
tionally representative sample of students and 
schools. Since the publication of the Coleman Re- 
port, nearly 400 additional studies of this sort have 
been conducted. Their results have been mixed, 
fueling, rather than resolving, the debate as to 
whether money matters to educational achievement 
(see Hanushek 1997 for list of studies). 

Because of the mixed results of this very large 
number of studies, some researchers have concluded 
that the production function approach is flawed and 
should be abandoned. In their view, production 
function studies suffer from a multitude of prob- 
lems, including their failure to analyze different 
types of educational expenditure (such as spending 
on instruction and administration) and their failure 
to adjust for regional variations in the cost of edu- 



cation (Fortune and O'Neil 1994). Some research- 
ers suggest alternate approaches to estimating the 
relationship between expenditures and achieve- 
ment. Monk (1992) suggests conducting small-scale 
studies at low levels of aggregation, such as the class- 
room level; Fortune and O'Neil (1994) suggest com- 
paring the achievement levels of specific subgroups, 
such as high-spending and low-spending urban 
school districts. 

This paper contends that the production func- 
tion approach is salvageable; the problems research- 
ers have identified can be addressed, producing 
meaningful results. The present study provides an 
example of how this may be done. It applies struc- 
tural equation modeling and multilevel modeling 
to recently developed databases of fourth-graders. 
The study is national in scope, distinguishes be- 
tween different types of spending, and adjusts for 
regional variations in the cost of education, thus 
addressing many of the issues raised by critics of 
the production function approach. The study finds 
that, at least for fourth-graders, some inputs are 
strongly associated with academic achievement, 
while others are not: Instructional expenditures, 
central office administration expenditures, and 
teacher-student ratios are all associated with 
achievement; principal's office expenditures, capi- 
tal outlays, and teacher education levels are not. 
Before discussing these results and their derivation. 
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For the purposes of this paper, "expenditures" refers to actual dollars spent by school districts, "resources" to quantifiable 
goods made available to schools, and "inputs" to both expenditures and resources. 
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however, it is necessary to touch upon the method- 
ological issues involved in production functions. 

Background 

Production function studies of education have 
been undertaken for more than 30 years. By one 
estimate nearly 400 production function studies 
have been conducted and published since the 
Coleman Report of 1966 (Hanushek 1997, 1996). 
These studies have tended to use samples that are 
smaller in their geographical scope than the national 
Coleman Report, and have studied the same sorts 
of inputs that report did (aggregate per-pupil ex- 
penditures). These studies have come to different 
conclusions regarding the production function, 
some finding relationships between a given input 
and academic achievement, and others finding no 
such relationship. 



sions. 



More recently, studies known as 
"meta-analyses" have applied statis- 
tical techniques to synthesize the 
findings from production function 
studies; these too arrived at contra- 
dictory conclusions. Hanushek 
(1989) conducted a meta-analysis 
covering both expenditure and re- 
source measures, and found no re- 
lationship between these inputs and 
academic achievement. Hanushek 
synthesized the findings of 187 pro- 
duction function studies using the 
technique of vote counting. He first 
divided each study into its compo- 
nent inputs. A study that related 
class size and teacher education to achievement, for 
example, was divided into those two inputs. Each 
input was then placed in one of seven categories: 
per-pupil expenditures, teacher experience, teacher 
education, teacher salary, teacher-student ratio, ad- 
ministrative inputs, and facilities. Within each cat- 
egory, the relationship of the input to the studied 
output was classified as positive and statistically sig- 
nificant, positive and statistically non-significant, 
negative and statistically significant, negative and 
statistically non-significant, and non-significant but 
of unknown direction. Hanushek found most rela- 
tionships to be non-significant. Of 65 aggregate per- 
pupil expenditure relationships, for example, he 



More recently, studies 
known as “meta- 



found 13 to be positive and significant, 3 negative 
and significant, and 49 to be non-significant. He 
concluded that "there is no strong or systematic re- 
lationship between school expenditures and student 
performance (1989, 47)." 

Hedges, Laine, and Greenwald (1994) reana- 
lyzed most of the same studies, and drew the oppo- 
site conclusion. They first excluded from their analy- 
sis the relationships Hanushek had classified as non- 
significant but of unknown direction. For the re- 
maining relationships, they reinterpreted 
Hanushek' s vote counting in the context of rules 
regarding statistical significance. They argued that, 
if the relationships are treated as a sample, in order 
to draw the conclusion that there is no relationship 
between an input and achievement, no more than 5 
percent of the relationships could be significant, and 
these relationships would have to be equally divided 
between the positive and negative 
directions. Yet, in fact, if relation- 
ships of unknown direction are ex- 
cluded, many more than 5 percent 
of the relationships are significant 
(up to 30 percent for per-pupil ex- 
penditures); most of the significant 
relationships are in the positive di- 
rection. The bulk of insignificant 
relationships are also in the posi- 
tive direction. 



analyses" have applied 
statistical techniques 
to synthesize the 
findings from produc- 
tion function studies; 
these too arrived at 
contradictory conclu- 



After reinterpreting the vote 
count. Hedges, Laine, and 
Greenwald (1994) applied a signifi- 
cance test, the inverse chi-square, 
to combine the relationships for 
each input into a single significance measure. They 
tested two hypotheses: that each input is positively 
related to achievement, and that each is negatively 
related to achievement. They found, for the full 
sample of relationships (as well as for various 
subsamples), that almost all relationships were sig- 
nificant in the positive direction, with a few others 
being significant in the negative direction. Finally, 
for each input. Hedges and his colleagues combined 
the coefficients from those studies that reported 
them by calculating their median. They found posi- 
tive coefficients for per-pupil expenditures, teacher 
experience, teacher salary, administrative inputs. 
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and facilities, and mixed results for class size, and 
concluded that resources affect achievement. 

Hanushek (1996) continued the debate, counter- 
ing the meta-analysis of Hedges, Laine, and 
Green wald. He updated his sample of studies to 
include those published after his 1989 meta-analy- 
sis, making a total of 377 studies. Hanushek again 
found, when he classified relationships into the 
seven categories, that the bulk of studies indicated 
no significant relationship between resources and 
achievement. In a counter-study of their own, 
Greenwald, Hedges, and Laine (1996) created their 
own sample of studies, and placed the relationships 
they identified from the studies into seven some- 
what different categories: per-pupil expenditures, 
teacher ability, teacher education, teacher experi- 
ence, teacher salary, teacher-pupil ratio and school 
size. They again found, for both this new sample 
and for various subsamples, that the 
combined significance test and me- 
dian effect sizes supported the hy- 
pothesis that resources affect 
achievement. Most recently, 

Hanushek (1997) has compared his 
sample of 377 studies to the sample 
of Greenwald, Hedges, and Laine 
(1996), and found that the latter 
sample systematically over-repre- 
sented positive relationships. 2 

The fact that different meta- 
analyses can reach different conclu- 
sions from similar sets of studies 
indicates that the underlying stud- 
ies are quite volatile in their results 
when subjected to different assumptions. This vola- 
tility was even revealed within the meta-analyses. 
For instance. Hedges, Laine, and Greenwald (1994) 
were able to find support both for the hypothesis of 
a positive relationship and that of a negative rela- 
tionship between a given resource and achievement 
when using combined significance tests. Both 



Greenwald, Hedges, and Laine (1996) and 
Hanushek (1997) found that the results from a 
subsample of longitudinal studies differed markedly 
from those of the full sample. What the meta-analy- 
ses reveal most clearly, then, is that the original stud- 
ies do not provide a clear answer to the question of 
whether or not money matters. 

This lack of consensus among the meta-analy- 
ses reflects to some degree shortcomings in the meth- 
ods of the original studies. Six shortcomings have 
been commonly noted. 

First, unlike the Coleman Report, most subse- 
quent studies were not nationally representative, but 
instead studied a particular state or school district. 
This hampers development of a consensus, because 
different regions of the country may have different 
spending patterns and different relationships be- 
tween these spending patterns and 
student achievement. 

Second, the studies did not dis- 
tinguish among different types of 
spending. While they measured 
multiple inputs, such as teacher ex- 
perience and teacher-student ra- 
tios, the only expenditure measure 
used was aggregate per-pupil ex- 
penditures. Using such a gross 
measure risks missing certain dy- 
namics in the relationship between 
school spending and academic 
achievement, as increases in some 
types of spending may have an ef- 
fect while increases in others may not. For instance, 
increased spending on administration may not sig- 
nificantly raise achievement, while increased spend- 
ing on instruction may. If these types of spending 
are not measured separately, the apparent effects of 
spending on instruction will be reduced or elimi- 
nated when combined with the lack of effects from 
administration. 
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Other meta-analyses have also arrived at contradictory conclusions. With regard to class size. Glass and Smith (1979) found a 
clear and consistent relationship while Odden (1990) did not. The effect of class size on student achievement has also been the 
subject of a controlled experiment in which students in kindergarten and first grade were randomly assigned to small and 
large classes. The study found significant achievement differences that persisted even after the students in small classes were 
returned to large ones (Finn and Achilles 1990; Mosteller 1995; MosteUer, Light/and Sachs 1996). This finding, like those of 
production function research, has been the subject of controversy (Hanushek 199 7).‘ 

iqI 



The fact that different 
meta-analyses can 
reach different conclu- 
sions from similar sets 
-of studies indicates that 
the underlying studies 
are quite volatile in their 
results when subjected 
to different assump- 
tions. 
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Third, the studies did not take into account the 
ways in which other influences on the process of 
schooling may mediate between spending and 
achievement. Effective schools research suggests 
that certain aspects of the school environment, par- 
ticularly supportive relations between teachers and 
principals, positively influence achievement. 3 Yet 
none of the prior research has sought to measure 
the influence of school spending patterns on school 
environment. 

Fourth, not all of the studies provided rich mea- 
sures of student background. 4 While the research 
on measures of the socio-economic characteristics 
of students indicates that a single measure, socio- 
economic status (SES), can be generated by adding 
together responses to a relatively small number of 
questions, many studies did not include such ques- 
tions. If SES is poorly measured, it is difficult to 
determine if relationships between 
spending and achievement are at- 
tributable to some degree to SES 
differences between students in 
high- and low-spending districts. 

Fifth, most studies did not con- 
trol for variations in cost between 
regions. The cost of living in New 
York City is higher than the cost of 
living in Montgomery, Alabama, 
and presumably this difference 
means that teachers paid the same 
actual dollars in the two cities are 
not able to maintain the same stan- 
dard of living; a dollar will buy less 
in New York City. As a result. New 
York City would have to offer higher salaries to re- 
cruit successfully the same teachers as Montgom- 
ery. 5 Other factors may also influence the cost of 
hiring comparable teachers, including union pres- 



sure to increase wages and the overall quality of life 
in the region. Most studies did not take these fac- 
tors into account, and they may be as important as 
SES, in that differences in achievement between two 
districts may be due to some degree to differences 
in how much it costs to hire teachers. 6 

Sixth, many of the measures of achievement 
used by earlier studies were unsophisticated. Some 
did not use achievement measures at all but merely 
relied on proxies, such as graduation rates. Some 
used measures as simple as whether or not a stu- 
dent passed a minimum competency test. Few took 
into account modem developments in test theory, 
such as Item Response Theory (IRT). 7 

Finally, the prior research has not taken into ac- 
count the multilevel nature of school effects. Mea- 
suring the relationship between school characteris- 
tics and student achievement en- 
tails relating variables whose level 
of analysis is the school or school 
district to an outcome whose level 
of analysis is the student. Various 
estimation techniques have been 
developed that take the multilevel 
nature of school effects into account, 
and it has been found that these 
techniques sometimes produce re- 
sults that differ from more conven- 
tional techniques. In particular, 
conventional techniques often un- 
derestimate standard errors and, in 
some cases, fail to identify impor- 
tant components of school effects 
(Raudenbush and Willms 1995; 
Bryk and Raudenbush 1992). Production function 
models have generally not made use of estimation 
techniques that are sensitive to multilevel data, and 
consequently may produce inaccurate results. 



Effective schools re- 
search suggests that 
certain aspects of the 
school environment, 
particularly supportive 
relations between 
teachers and principals, 
positively influence 
achievement. 



3 Despite some early criticism of effective schools research (e.g., Cuban 1984; Purkey and Smith 1983), later large scale multivariate 
studies have persuaded most researchers that there is a social dimension to school life that plays some independent role in 
student achievement. The extent of this role is, however, still being debated (Lee, Bryk, and Smith 1993). 

4 This was pointed out by Hedges, Laine, and Greenwald (1994, 12). 

5 When cost of living is taken into account, differentials in per-pupil expenditures between high-spending and low-spending 
states decrease markedly, indicating that states with fewer resources often tend to be states with lower costs of living (Barton et 
al. 1991). 

6 This was pointed out by Fortune and O'Neil (1994, 24). 

7 For a discussion of this shortcoming in production function research, see Fortune and O'Neil (1994, 24). For a discussion of 
IRT, see Hambleton et al. (1991). 
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Some researchers doubt that these problems can 
be addressed, and have argued that the production 
function approach should be abandoned, and alter- 
nate approaches explored. Monk (1992) proposes 
to shift the unit of analysis for school resource stud- 
ies to the classroom level. He notes that prior re- 
search has found a great deal of variation in the ef- 
ficacy of teachers within the same school, as well as 
variation in the efficacy of a particular teacher dur- 
ing different classes. He classifies teachers as being 
of two types, those who are engaged with their 
classes, trying actively to address any problems in 
them, and those who are accommodating, seeking 
only to avoid dealing with problems. He views 
whether the teacher chooses the engagement or ac- 
commodation route as dependent upon a number 
of factors, including resource decisions made at the 
school level. Monk calls for a research program in 
which teachers are interviewed to provide retrospec- 
tive information on the problems 
they face, their responses to these 
problems, and the degree to which 
resources operate as a constraint. 



Another alternative to produc- 
tion function research is the thresh- 
old approach, proposed by Fortune 
and O'Neil (1994). They argue that 
the key problem with production 
function research is its use of linear 
models. They hypothesize that in- 
put-output relationships occur in a 
punctuated manner, with small in- 
crements of inputs having no effect 
on achievement, but large incre- 
ments having a large effect. To es- 
timate this effect, they propose comparing the mean 
achievement levels of school districts that are in the 
top 30 percent in terms of spending to school dis- 
tricts that are in the bottom 30 percent. To address 
the problem that demographic variables might be 
at the root of achievement differences, they propose 
using demographically similar school districts for 
the comparison. They also propose eliminating out- 
lying cases. In applying this approach to samples 
of school districts in Missouri and Ohio, they find 



Such alternatives, 
however, raise their 
own methodological 
issues. The most 
significant is that in 
both cases it is diffi- 
cult to separate the 
factors contributing to 
student achievement. 



that while correlation coefficients rarely uncover an 
input-output relationship, the threshold approach 
often finds one. 

Such alternatives, however, raise their own 
methodological issues. The most significant is that 
in both cases it is difficult to separate the factors 
contributing to student achievement. In the class- 
room-based approach, efficacious and non-effica- 
cious teachers are identified and the resource con- 
straints traced. Yet, the availability of one type of 
resource tends to be highly correlated with avail- 
ability of another. Since the efficacious teacher may 
have many resources available at once, it will be 
difficult to determine which is the basis of high 
teacher efficacy. In addition, it will be difficult to 
determine whether high student achievement is 
primarily attributable to teacher efficacy or student 
characteristics. 8 In the threshold approach, only 
school districts with extremely high 
and extremely low levels of expen- 
ditures are compared. In many ar- 
eas the high expenditure districts 
will have high levels of resources 
and the low expenditure districts 
low levels of resources. It will thus 
be difficult to determine which re- 
source is responsible for achieve- 
ment levels. In addition, it will be 
difficult to determine whether or 
not the resource levels in the schools 
or the resource levels in the commu- 
nities account for achievement dif- 
ferences. 9 



Given that these alternate ap- 
proaches raise their own difficulties, it may also be 
worthwhile to salvage the production function ap- 
proach through addressing its problems. The 
present study is an attempt to do just that. 

The Design of the Study 

Hypotheses 

This study hypothesizes that there are various 
potential paths through which school district expen- 



It is also worth noting that to draw conclusions about expenditures and resources, the classroom approach will still have to 
collect district- and school-level data, because expenditure and resource decisions are made at those levels, not the classroom 
level. Thus many of the methodological problems of production function studies will also hold for the classroom approach. 
NCES (1995a) found 'that school district expenditures and the average SES of the districts were strongly related. 
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ditures and school resources can influence student 
achievement (figure 1). These paths occur in three 
basic steps. First, the allocation of money at the 
school district level influences the availability of 
resources at the school level. Most decisions about 
how to spend money are made by school superin- 
tendents and their staffs. These spending decisions 
determine how much of each school resource is pur- 
chased, and therefore what is available in the school. 
For instance, more spending on instruction will lead 
to some combination of more teachers per student, 
higher teacher salaries and more instructional ma- 
terials. 

Second, the availability of resources has conse- 
quences for the school climate. Schools vary widely 
in environment, some possessing low levels of stu- 
dent and teacher absenteeism, collegial relationships 
between teachers and principals, and a lack of dis- 
ruptive and delinquent behaviors, and others pos- 
sessing the opposite. In part, environment is influ- 
enced by the availability of resources; teachers who 



are paid lower salaries, for example, might be ex- 
pected to be more frequently absent. 

The third step involves the influence of school 
climate on student achievement. Effective schools 
research suggests that school climate strongly influ- 
ences student performance (Lee, Bryk, and Smith 
1993; Austin and Garber 1985; Brookover et al. 1979; 
Edmonds 1979). Disruptive students, high levels of 
student and teacher absenteeism, and frayed prin- 
cipal-teacher relations can be expected to interfere 
with the ability of teachers to instruct and students 
to learn. 

It is hypothesized here that some educational 
expenditures influence achievement via these steps. 
Four types of expenditures are considered: instruc- 
tional expenditures, central office administration 
expenditures, principal's office administration ex- 
penditures, and capital outlays. The first two, it is 
hypothesized, will directly affect school resources. 



Figure 1 .—Hypothesized paths to achievement 




SOURCE: Wenglinsky, unpublished. 
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